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A PRACTICAL METHOD OF ESTIMATING AN INDIVIDUAL’S 
MAXIMAL OXYGEN INTAKE 


By J. S. Marirz, J. F. Morrison, J. Peter, N. B. Srrypom and 
C. H. WynpHAM 


Applied Physiology Laboratory, 
Transvaal Chamber of Mines Research Laboratories, J ohannesburg 


The premises upon which simple methods for estimating an individual’s maximum 
oxygen intake are based have been tested. These premises are : 

(a) that heart rate and oxygen intake are linear functions of each other through- 
out the entire range of work up to the individual’s maximum ; 

(6) that oxygen intake of the individual deviates very little from the mean straight 
line relating oxygen intake and rate of work for the population, so that the oxygen. 
intake for a task performed against gravity can be estimated with reasonable 
precision from the rate of work ; and 

(c) that the individual variability of maximum heart rate round the mean for the 
population is sufficiently small to use the mean in a routine test procedure without 
the introduction of large errors. 

Premises (b) and (c) are fully substantiated by the results reported here. 
Premise (a) is not strictly valid, as there is a bias of the order of 0°3 1. oxygen/min 
in the straight line relating heart rate and oxygen intake at high levels of work in 
most individuals. 

The errors involved in the method proposed by Astrand (1954) based on these 
premises have been examined and compared with those of an alternative procedure 
described in the present paper. The variance of maximum oxygen intake using 
the Astrand method, due to errors in measurement of heart rate alone, is of the 
order 0-53, compared with 0°26 of the procedure described in this paper. In addition, 
one measurement only of heart rate in the Astrand method introduces a bias of 
about 0-3 1. oxygen/min in the estimate of an individual maximum oxygen capacity 
of 3-01./min. The variance of maximum oxygen intake by the present method 
can be reduced by fitting a straight line by least squares to plots of four pairs of 
heart rate and oxygen intake values at four rates of work. 


§ 1. INTRODUCTION 


Most manual labour is carried out on aerobic metabolism—that is to say, the 
rate of oxygen utilization for the chemical processes concerned with the release 
of energy in muscle is balanced by the rate of oxygen intake. The physical 
effort required for the period of an average shift could not be sustained 
otherwise. Quantitative expression has been given to these considerations by 
experienced research workers in this field such as Christensen (1953), Brody 
(1945), Astrand (1952) and others. Their opinion is that the average level of 
effort for endurance work should not require more than half the maximum 
oxygen intake of the individual. 

An individual’s maximum oxygen intake can be determined by the method 
proposed by Astrand (1952) or by that of Taylor et al. (1955). ; These two test 
procedures differ in detail but are otherwise similar. The subject exercises at 
successively higher rates of work until a level is reached at which : further 
increase in the rate of work does not produce a significant increase in rate of 
oxygen intake ; the time for the onset of acute fatigue is short ; and the 
soncentration of lactic acid in the blood rises sharply. These’ methods are 
tedious and time-consuming and, moreover, the present criteria proposed by 
shese workers for assessing accurately the ‘true’: level of maximum oxygen 
intake by laboratory methods are not acceptable, as pointed out in studies by 


Wyndham et al. (1959). 
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The S. African gold mining industry recruits annually a labour force of 
about 300 000 untrained natives of Southern Africa and, because it is migrant, 
labour turnover is almost 100 per cent per annum. A relatively large propor- 
tion of the mining operations are carried out with manual labour. Under 
these circumstances the measurement of the maximum oxygen intake of an 
individual recruit has potential value in placing the right man with the 
minimum of delay in the right place as far as his capacity for physical effort is 
concerned. The laboratory procedures of Taylor ef al. (1955) and Astrand 
(1952) are too lengthy and costly in terms of scientific staff and apparatus to 
apply to a labour force of this size with a high rate of annual turnover. 

An alternative to the laboratory test, which has the merit of being simple 
and brief, is the procedure proposed by Astrand and Rhyming (1954). The 
maximum level of oxygen intake of an individual is estimated from a nomogram, 
using the measurement of heart rate and prediction of the rate of oxygen 
intake from the work rate at a SINGLE level of step climbing. The nomogram 
is based upon certain premises, implied but not explicitly stated. The first 
premise is that heart rate is a linear function of oxygen intake up to maximum 
rates of work ; the second, that the deviations of individual maximum heart 
rates from the mean maximum for the population are small ; and the third 
is that all the individual straight lines representing the relationship between 
heart rate and oxygen intake have a common point which for males is 
60 beats/min at zero levels of oxygen intake. A fourth premise is that the 
oxygen intake of the individual deviates very little from the mean straight 
line relating oxygen intake to rate of work for the population, so that the 
oxygen intake for a task performed against gravity, such as climbing on and 
off a step, can be estimated with reasonable precision from the rate of work. 
For step climbing the work rate can be calculated from the product of gross 
body weight, the height of the step and the rate of climbing. (This generaliza- 
tion fails to take account of energy losses over the whole cycle of stepping, 
but for the present purpose is exact enough.) 

If these premises are valid, then the following procedure, expressed in 
nomogram form in the paper by Astrand and Rhyming (1954), can be used to 
assess the individual maximum level of oxygen intake with reasonable 
precision :— 

Heart rate is measured at a certain rate of step climbing and from the 
average oxygen intake/work rate graph the corresponding oxygen intake 
is estimated. This is then plotted against the observed heart rate, giving 
a single point on an oxygen intake/heart rate grid. Through this point 
and the ‘common point’ a straight line is drawn and extrapolated to a 
heart rate of 128 or 196 beats/min, and the corresponding oxygen intake 
is an estimate of the half maximum or of the maximum oxygen intake 
capacity. 

From the point of view of simplicity and time Astrand and Rhyming’s 
procedure has much to commend it. However, in our view the test cannot be 
appled immediately in industry because some of the techniques are not 
acceptable and certain premises do not appear to have been tested rigidly 
enough. The unacceptable techniques are the fitting of the straight line 
through the heart rate of 60 beats/min at the zero oxygen intake and the 
plot of a SINGLE measurement of heart rate at ONE rate of work only. Heart 
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rate determinations are subject to ‘ individual ’ random variation due to ‘ error 
of measurement’ and ‘ natural physiological’ causes. Hence a straight line 
fitted through two points, one of which is an assumed and perhaps biased 
value and the other subject to these variations, may have a considerable 
error. The error is reduced if three, or preferably four, measurements are 
made and especially if a straight line is fitted by the method of least squares. 

In order to test the premises on which the test procedure rests, we have 

made a number of studies to establish :— 

(a) The form of the functional relationship between heart rate (H) and 
work rate (W), between oxygen intake (O) and W, and between H 
and O, under the following headings :— 

(1) whether the heart rate and oxygen intake are, to a close degree of 
approximation except for random variation, linear functions of 
the rate of work ; 

(i) whether heart rate and oxygen intake reach some asymptotic 
maximum value at high levels of work ; 

(ui) whether, if (i) and (ii) are correct, heart rate and oxygen intake are 
linear functions of each other throughout the entire range of work 
rate up to an individual’s maximum. 

(6) The feasibility, in a routine scheme for testing recruits, of substituting 
for the actual measurements of oxygen intake values read off a mean 
population graph relating oxygen intake to work rate. 

(c) Whether the individual variability of maximum heart rate values round 
the MEAN for the population is sufficiently small to use the population 
mean in a routine test procedure without the introduction of large errors. 

It is important to draw attention to the fact that these studies were made 

in Johannesburg, at an altitude of 6000 ft above sea level. 


§ 2. METHODS 


2.1. Design 

Four African mine labourers recruited from one of the Shangaan tribes in 

Mozambique were trained to step on and off a stool | ft in height at a rate of 
24 steps/min. When they were trained a series of studies were made which 
lasted five months. On different days of the week, from Monday to Friday, 
they stepped at 6 steps/min, 12 steps/min and 24 steps/min for time periods 
which varied from two to six hours. The rate of work on each Monday was 
24 steps/min. For the remainder of the week the above rates were taken at 
random. A few tests were made at a rate of 30 steps/min. 
At the end of the above study the four subjects were trained to pedal at 
70 rev/min on a bicycle ergometer. When trained to this procedure, they 
worked for a total period of four months at work loads equivalent to 2500, 
5000, 7000, 8000, 8500, 9000, 9500, 10 000, 10 500 and 11 000 ft lbs/min. On 
each Monday morning the rate of work was 7000 ft lbs/min. On other days of 
‘the week the rate of work was taken at random. 

A group of 26 African mine recruits, chosen at random, was studied on 
recruitment to one of the mines and again one month later and a further four 
months after recruitment. The initial physiological measurements were made 
before breakfast while the men rested in bed, having slept the night in a 
a 
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hospital ward. During the morning they performed three half-hour periods 
of step climbing. The work periods followed each other without pause and, 
in succession, were 6, 12 and 24 steps/min. 


2.2. Observations 

Heart rate was determined during the initial few months by means of a 
stethoscope at the apex of the heart and was counted over half a minute. 
Subsequently, measurements were made with a portable Sanborn electro- 
cardiograph. 

Oxygen consumption was measured by the conventional Douglas bag and 
Haldane gas analysis method. Duplicate aliquot samples of expired air taken 
for analysis were required to agree to within 0-05 vol. per cent. At low levels 
of work gas collections generally were over three min, but at the very high 
levels on the bicycle ergometer collections had to be limited to | min. 


2.3. Subjects 
Physical characteristics of the four trained African recruits from Mozambique 
are given in Table 1. The height, weight and tribe of the 26 recruits chosen 
at random are given in Table 2. Accurate ages of these men cannot be given 
as their births are not registered. They are all young male adults in the 
early 20’s. 


Table 1. Physical characteristics of four trained men 


On arrival End of series 
Height Weight Height Weight 
No. (in.) (Ib oz) (in.) (lb oz) 
1 624 144-5 624 146-2 
2 612 121-10 61? 118-11 
3 64 125-1 64 125°-8 
4 643 127°4 643 131-8 


2.4. Apparatus 

At high levels of work the ventilation rate of the subjects reached 
150 litres/min, and respiratory valves which would offer little resistance to 
rates of flow of this order were therefore used. 

The bicycle ergometer, electromagnetically braked, was calibrated at 
approximately monthly intervals. To ensure consistency of work rate the 
pedalling rate was recorded electrically with a generator and galvanometer- 
type recorder ; the times of gas collection were marked on the record. 


§ 3. ResuLts 
3.1. Hxamination of the relationship between H and W, between O and W, and 
between H and O : 


3.1.1. Repeated observations on four trained men at submaximal work loads 

A large number of measurements of H and O were made over a period of | 
months on four trained subjects. while they were stepping on and off the_ 
1 ft stool at rates of 6, 12, 24 and 30 steps/min. Direct estimates of residua 
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variances of H and O could therefore be made and used to test whether a 


straight line is a good fit to plots of H/W, O/W and H/O, i.e. in order to test the 
first premise (a) outlined in Section 1. 


high value of W. 
a 


Table 2. Physical characteristics of 26 recruits 
Height Weight 
No (in.) (lb oz) Tribe 
ak 66 125-14 —_ 
8 68 116-2 Basuto 
9 663 116-5 Shangaan 
10 65 118-2 Shangaan 
11 683 112-11 Shangaan 
12 63 115-6 Basuto 
13 70 142-13 Hlubi 
14 68 110-11 Bechuana 
15 64 110-10 Xosa 
16 724 128-12 = 
17 703 135-11 Bechuana 
18 613 115°5 Xosa 
19 624 120-0 Xosa 
20 654 131-2 Basuto 
Zt 644 107-12 Xosa 
22 653 110:8 Basuto 
23 67 127-14 Xosa 
24 67 114-12 Xosa 
25 63 112:5 Bechuana 
26 70} 119-5 Bechuana 
2h 67 150-7 Basuto 
28 65 128-5 Basuto 
29 64 LT, Hlubi 
30 67 123-12 Basuto 
31 66 LUF-12 Shangaan 
32 673 122-0 Xosa 
Table 3. Observed mean values of H and O and variances 
‘ H ? 
No. 1 No. 2 No. 3 No. 
Res Res. Res. Res. 
Work rate N Mean Var N Mean Var. N Mean Var. N Mean Var. 
6x 12 5 73°6 84:8 .-5 84-8 123-2 4 70:0 53 4 88-0 42-7 
Us 12 8 88-0 64:0 8 94-5 109-9 3 78:7 37:3. 3 104:0 48-0 
24x 12 31 110-8 53:5 39 113-7 97:6 22 96-6 22-6 22 123°6 42-5 
30 x 12 6 115-3 34-6 6 127:3 a: Oeeee 98-7 101:3. 3 118-7 69°3 
6 O > 
Ine a ST 
6x 12 3 0-560 0-0018 — — 8 0-580 0:0108 2 0-559 0-0009 
12x 12 4 0-943 0:0045 5 0-832 0-0069 2 0-670 0-:0102 3 0-767 0-0106 
24x12 1l 1-455 0-0168 13 1-275 0-0198 4 1-076 0-0275 6 1-181 0-0039 
30x 12 1-419 0-0258 — — — — 2 1-164 0-0017 


The observed means and variances of H and O at the different rates of W 
for each individual are given in Table 3. 
rejected because the mean values of H and O are no greater than those at 
24 steps/min and this indicates that the men were not able to maintain this 


The results at 30 steps/min were 
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Straight lines were fitted through the observations of H against W and 
O against W by the method of least squares. It gives estimates and £ of 
the parameters « and f in the linear relationships 

a =a+BW : where x is the unknown H or O and 
W has the designation as in the text. 

The values of « and f for each subject and their standard errors (s.e.) are given 
in Table 4. The straight lines were tested for goodness of fit by means of the 
variance-ratio test, resulting in the Snedecor ‘ Ff’ as test criterion. Values of 


Table 4. Estimates of parameters of straight lines fitted to H, W and O, W 


We IOs WS 
No ay s.e.(a}) By s.e.(B3) a? s.e.(a°) B° s.e.(B*) 
1 65-298 3-174 007652 0-006056 0-64.20 0-801 0-001273 —0-002963 
2 75-109 3-912 0-08076 0-008745 —0-5117 0-715 0-001552 —0-003567 
3 63-848 2-952 0-06340 0-006462 — 0-3892 0-624  0-001362  0-003301 
4 


85-961 4-511 — 0-07086 +=0-009717 —-0-4229 0-650  0:001378 0-003244 
F, and the probability of their being exceeded if the hypothesis (of linearity) 
under test is true, are given in Table 5 and they indicate that, on the whole, 
the straight lines fit satisfactorily. 


Table 5. Test of goodness of fit based on Snedecor ‘ Ff’ value, together with probability (P) 
of given value being exceeded 


Stel Ne © JOS We 
No. Ff lat dig F dens ie 
1 2-434 2, 46 >0-10 2°436 4,17 > 0-10 
2 0-400 2, 54 > 0-50 4-416 3,13 > 0-05 
3 1-976 2, 28 > 0-10 0-439 1,16 > 0-50 
4 7-854 2, 28 <0-01 0-000 ihe 3) > 0-50 


As the O, W and H, W relationships are linear the same must be true of the 
O, H relationship, and its constants can be computed from those of the O, W 
and H, W straight lines. These straight lines with their 90 per cent confidence 
bands are given in Fig. 1. The limits are very wide at the high levels of H 
because there were few observed H values above 140 beats/min. . 


3.1.2. One measurement of H and O on 26 men at rest and at three rates of W 

H was measured at 5 min intervals and O once during rest and during a 
30 min spell of work at each of three values of W, viz., 6, 12 and 24 steps/min. 
Through each set of four points (plots of H or O against W) a straight line was 
fitted by least squares. By taking differences between observed and fitted 
values, squaring, adding and dividing by the appropriate number of degrees" 
of freedom (two in this case), an estimate of the residual variance, based on 
two degrees of freedom, was found for each of the 26 men. The means of 
these residual variances are given in Table 6 for comparison with the residual 
variances found on the four trained men. 


Table 6. Mean residual variances of 26 men 


H=23-1 
O = 0-00784 
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Tests of goodness of fit similar to those for the four men could not be 
carried out but, if H and O are both linearly related to W, they are also linearly 
related to each other. Therefore inspection of plots of H against O should 
enable one to make a reasonable judgment as to the validity of the hypothesis 


Figure 1. Straight line, with confidence limits, fitted to plots of heart rates against oxygen 
_ intakes for trained subjects Nos. 1-4. 
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Figure 1. Subject No. 3. 
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Figure 1. Subject No. 4. 


of linearity. Such plots, for each of the 26 men, are given in Fig. 2, and they 
suggest strongly that the hypothesis is acceptable. 

Figure 2 also shows that the straight lines do not have a common point 
at resting oxygen intakes. An objective test of this observation is available 
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because the variance between subjects of resting values of O and H may be 


compared with the residual variances given in Table 6. We have the followi ing 
results :— 


NO. 23 


HEART RATE BEATS /15 SEC. 
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Figure 2. Straight lines fitted to plots of heart rates against oxygen intakes of 26 untrained 
recruits. 


Between subjects variance at resting O =0-00706. 
Between subjects variance at resting H =160-00 and comparing these with 
the residual variances we have 


0:8 (22) 000 


Eee Hy (5 =6-64 


The differences between resting oxygen intakes are, therefore, not significant, 
but for resting heart rates they are very significant. This definitely refutes the 
suggestion that the H, O graphs all have a common point. 


3.1.3. Repeated measurements of H and O at different levels of W up to the 
individual maximum 

Repeated measurements of H and O were made at a number of values 
of W up to the individual maximum on the same four trained subjects plus 
an additional two, using a bicycle ergometer. Direct estimates of residual 
variances could therefore be made and these could, in turn, be used to test the 
oodness of fit of the mathematical model. (Ref. Table 11 p. 114.) 

fe 
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The mathematical model used to represent the plots of the data of H 
against W and O against W, and also the tests for goodness of fit of the model 
have been detailed in a separate paper by Wyndham ef al. (1959). It suffices 
here to state that the model which was found to fit the data well was one with 
a linear component up to 7750 ft Ibs/min level of W and an exponential 
component above the value of W. The curve relating H to O for each individual 
was produced by taking a number of values of W up to the individual's 
maximum and reading off the corresponding H and O values from the respective 
fitted curves. These ‘ estimated mean’ values of H and O were then plotted 
against each other and are given in Fig. 3 for the six trained subjects. At high 
levels of W there is a small departure from linearity due in most cases to the 
fact that, as W approaches the individual maximum, O tends to its asymptote 
more slowly than H. This means that the value of O obtained by extrapolating 
the straight line to intersect the maximum H value is usually less than the 
maximum measured O or the value of O at the asymptote. Some idea of the 
magnitude of the ‘ bias’ may be gained from Fig. 3, where the straight line 
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Figure 3. Subject No. 1. 


Figure 3. Pairs of heart rates and oxygen intake values between 2500 and 12 000 ft Ib/min 
are taken from the curves fitted to O, W and H, W observations. The above curves are 
fitted to these pairs of ‘mean’ H and O values for subjects Nos. 1-6. Point R is the 
extrapolation of the straight line part of the fitted curve to the individual’s maximum 
heart rate ; M is the asymptote value of O for the individual’s O, W curve ; and S is the 
mean of the three highest measured values of O. The distances R-M and R-S represent 
the bias between asymptote (M) and measured values of maximum oxygen intake (S) 


and that obtained by extrapolating H values at submaximal work loads plotted against 
O values. 
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Figure 3. Subject No. 3. 
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Figure 3. Subject No. 5. 
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Figure 3. Subject No. 6. 


as been extended to intersect the maximum value of H at point R. S corre- 
onds to measured O at the highest levels of W and M to the value of O at 
e asymptote. The distances R—S and R-M are given in Table 7. Two 


Table 7. Bias of maximum oxygen intake 


No. R—S R—M 
ul 0°22 0°28 
2 0°21 0°35 
3 0°33 0°38 
+ 0°52 0°98 
4 
(Repeat) 0-101 0-156 
5 0-086 0-201 
6 — 0-260 — 1-258 


ulues are given for subject No. 4. On the first occasion he was tested he did 
ot exert himself to a level of W at which the asymptote of O corresponded 
the highest observed values of O. On the second series of experiments one 
ar later he worked harder and the observed and asymptotic values of O 
rrespond closely. Both values are given as an illustration of one of the 
fficulties that can arise in estimating the asymptote of O where a subject 
es not exert himself to the utmost. 
This analysis shows that the relationship between H and O can be repre- 
nted by a straight line up to very near the maximum values of W, but that 
ere is a small ‘ bias’ towards observed and asymptotic values of O higher 
an would be obtained by extrapolating the straight line part of the graph 
the maximum value of H and reading off the corresponding value of O. 
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3.2. Mean maximum values and variances of H for a population 

Repeated measurements of H were made at various levels of W up to the 
individual maximum effort on the six trained men. Appropriate curves were 
fitted and the goodness of fit tested using the residual variances. Maximum 
values of H at the asymptote correspond closely to the observed mean 
maximum values of H at the three highest values of W when at least five 
observations were made on each individual (Subjects 1, 2, 3, 4, Table 8). 
When fewer measurements were made (as in the case of Subjects 4 (repeat), 
5 and 6, Table 8) the standard error of the means is greater. In the latter the 
differences between the asymptotic and observed mean maximum H values 
were greater, although these differences are not significant. The mean 
asymptotic value of H for the six trained men is 184-1 beats/min with a 
standard deviation of 8-4. 

A further study was made on eight men trained at submaximal work loads 
on stepping stools but not to maximal loads on the bicycle ergometer. They 
were required to work up to a rate at which either H on two successive values 
of W was clearly assuming an exponential form, or at a rate at which they were 
exhausted within 5 min. These values are also given in Table 8. 


Table 8. Comparison of asymptote with measured maximum H 


Trained s.e. of Mean H at s.e. of Untrained Mean H at 


Men Asymptote H Asymptote 3 highest mean 1 men 3 highest 

W’s W’s 
1 178-94 1-39 179-29 1-23 No. 33 187 
2 180-17 1-87 179-28 1-53 34 177 
3 171-09 0-93 171-00 1-54 35 178 
4 182-98 1-02 182-26 0-80 36 170 
(Repeat 190-63 19-00 174-86 2-67 37 178 
5 190-40 9-30 188-12 2-22 38 186 
6 194-28 7:26 177-93 1-03 39 184 
40 175 

Mean 184-07 179-38 


The values of the means of the highly trained and the relatively untrained 
men are not significantly different, and are so close to 180 beats/min that it 
may be regarded as a svtisfactory working value for young healthy Bantu 
males. 


3.3. Estimate of rates of oxygen intake : 

3.3.1. Hstimate of oxygen intake from rate of work 

The aim of this part of the study was to examine the magnitude of the 
individual variations around the population mean O, W line. Thus one could 
decide whether it would be feasible, in routine practice, to estimate a man’s 0 
simply by entering his W in the equation | 
O=A,+ BLW 

Each individual in the populltion has his own O, W straight line, O =) + 6oW 
say, and A, and B, are the population means of a,’s and f,’s respective y 
The success of this proposal depends upon the variations between Q'S an d 
B,’s from one individual to another. The random error in estimating O from 
a ‘standard’ O, W graph has been calculated (Appendix 6.2) and is less than 
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the variances of measured O due to error and physiological variation, being of 
the order 0-002 compared with 0-02, the mean of the pooled variances of ‘ O’ 


of the four subjects (Table 9). 


Table 9. Pooled variances of H and O for each subject 
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igure 4. Values for the parameters and in the equation for a straight line, O=+W, were 
obtained from the straight lines fitted to four pairs of O and W values at four submaximal 
rates of work on 26 recruits. The straight lines, with 95 per cent confidence limits, in this 
Figure are drawn from the mean of the 26 and values. Series I is on recruitment ; Series IT 
after one month of work ; Series III after a further three months of work. 


The above estimates were made on new recruits to the mining industry. 
efore the procedure could be recommended on a wide scale it was important 
see whether the same linear relationship applies in other circumstances, for 
ample, to men who have been some time at work in the mines. In Fig. 4 
given the mean straight line fitted to the individual O, W graphs of the 
men, together with 90 per cent confidence limits ; in addition there are 


a 
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drawn similar straight lines on O, W data after one month and after a further 
three months of service. The parameters of the straight lines are so similar 
that one straight line could be used with little error to estimate O from W for 
all periods of service. 


3.3.2. Prediction of O from a measurement of ventilation (V) 


Estimation of O from determination of W alone has been considered. If 
observation of V were used in conjunction with W it might be expected that 
a more precise estimate of O would be obtained. If incorporation of V resulted 
in greatly increased precision it would be a worthwhile practical proposition, 
as its determination does not involve very complicated laboratory procedures. 

Calculation of variances of a, and B, in the eqn. 

V =a3,+B,W 
enables us to compare standard deviations of O estimated either indirectly 
from the relationship between V and W or directly from O against W. The 
standard deviation of the former is approximately 0-9 that of the latter 
(see Appendix 6.3). 

It is clear from this that the improvement in accuracy in measuring V 
compared with the prediction of O from W would not warrant the extra effort 
involved. 


3.3.3. Comparison of asymptotic value of O and average O at highest levels of W 
with O corresponding to maximum H value from extrapolation of H/O line 


From the point of view of the test procedure proposed it is important to 
compare : 
(i) the asymptotic value of O from the O against W curve using the 
bicycle ergometer data ; 
(ii) the mean O at the three highest levels of W from men on the bicycle 
ergometer ; 

(ili) the value of O which corresponds to the maximum H when the straight 
line fitted to the plots of H against O is extrapolated both for the data 
obtained on stepping stools and on the bicycle ergometer. : 

These results are given in Table 10. They show that the O values corre- 

sponding to the maximum level of H on the straight line graph of O, W are— 
closely similar whether the bicycle or a step test is employed. The O values 
are, however, generally less than either the highest measured value of O or the 
asymptotic value in the O, W curve. 
Another feature of the results is the very wide confidence interval of O at 
the maximum level of H, as determined from the stepping test data. This is 
due to the very considerable extrapolation of the confidence limits above the 
highest measured value of H in the tests (about 140 beats/min). This situation 
could clearly be improved by increasing the work load either by raising the 
height of the step or adding a weight load to the individual or by increasing 
the rate of stepping. The last of these alternatives proved impractical, as is 


seen in Table 3, where values of H and O for 24 and 30 steps/min were not 
distinguishable. 


Method of Estimating an Individual’s Maximum Oxygen Intake 113 


Table 10. Comparison of maximum oxygen capacities determined by four different procedures 


Mean O at O at the Maximum O Maximum O 
No. 3 highest W’s Asymptote Bicycle Step test 
; (a) (0) (c) (a) 
1 3-01 3-11 2-87 2-55 
(2-75-3-27) (2-99-3-23) (2:60-3:14) (2:11-2:99) 
2 2-72 2-90 2-56 2°53 
(2:62—2-82) (2-64-3-16) (2:31-2:81) (1-97-3-09) 
3 3-01 3-11 2-92 2-89 
(2-84-3-18) (2-97—3-25) (2:66-3:18) (2:01-3:77) 
4 3-13 3-64 2-66 2-25 
(2:97-3:29) (3-06—4-22) (2-57-2-78) (1-81—2:69) 
Repeat 2-81 3-06 2-66 — 
(2-59-3-02) (2-65-3-46) (2-47—2-85) 
5 2-84 3-00 2-93 = 
(2-65-3-02) (2-74-3-27) (2-84-3-02) 
6 2-55 2°71 2-35 = 
(2:38-2:72) (2-66—2-77) (2:25-2:44) 


§ 4. Discussion 
4.1. Examination of the method of Astrand and Rhyming 

These results have clarified the position with regard to the premises outlined 
in the Section 1. Two of the premises are fully substantiated. They are that, 
for a task performed against gravity, O can be predicted from W with a variance 
that is of the same order as the day-to-day variability in direct measurement 
of O ; that the variation of an individual maximum H from the mean for a 
population is small, so that for young healthy Bantu males a mean maximum 
of 180 beats/min can be employed in any test procedure without introducing 

ignificant errors. 
This study did, however, show that one of the premises is not strictly valid. 
A straight line is a good representation of the relationship between H and O 
ver most of the range of work rates, but at levels of W near the individual 
naximum the relationship departs sharply from linearity, so that the measured 
aximum O or the asymptotic value of O is generally higher than would 
orrespond to the maximum H value when the straight line fitted to plots of 
, Ois extrapolated to that value. The extent of the ‘ bias ’ is, however, small. 
The most important shortcoming, brought out by this study, in and 
nd Rhyming’s proposed procedure is that the straight lines fitted to the 
lots of four pairs of H and O values for each of the 26 men DO NOT have a 
ommon point. The difference between resting O’s in the 26 men were not 
ignificant, but the differences between the H’s were very significant. Astrand 
nd Rhyming’s procedure is to draw a straight line between the common 
point, viz., 60 beats/min at zero O, and the measured H plotted against 
stimated 0 for a single rate of stepping. Their method has therefore a number 
possible sources of error. The first is the ‘bias’ of the common point 
ompared with individual’s measured H at rest ; the second is the variances 
yf H at rest; the third is the random error in measurement of H during 
tepping ; ae the fourth is the ‘ bias’ of the true O during stepping compared 
ith that estimated from the population mean. Only the last two of these 
ources of error affect the estimate as proposed in this paper. The cumulative 
ffect of all errors on the precision of the estimate of O is examined for both 


aethods in the succeeding sections. 
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4.2. Alternative method for estimating the individual H/O straight line 

The unsatisfactory features of the Astrand and Rhyming method may be 
eliminated firstly, by omitting H’s which are not actually measured and, 
secondly by using additional work rates in order to obtain more data for 
fixing an individual line. Our proposal is as follows : select four rates of work 
and determine the subject’s H at each of them, estimate his four O’s from the 
average O, W line, fit a straight line through the four O, H points, extrapolate 
it to H =—180, and find corresponding O as an estimate of maximum O. 

It is important to make an assessment of the random error associated with 
the final maximum O determined in this way. There are various sources of 
error whose effects are, generally, cumulative, and they are : 

(i) random observer errors in measuring # ,; 
(ii) day-to-day physiological variations in H and O at given W ; 
(iii) the error introduced by estimating O from the ‘standard’ graph of 
O, W instead of measuring O directly ; 

(iv) the use of a single value (approximately 180) as maximum # for all 

subjects. 

Many estimates of the combined effect of (i) and (ii) are given in this paper 
(e.g. Tables 9, 6 and 11) and, as a representative value for specimen calculations 
of var (O max) the figure var (H) =40 was used. 


Table 11. Pooled residual variances for each subject 


Heart rate 
Pooled 
No. xX? at @ Res. Var. | Asymptote 
1 10-7 9 0-30 33-36 178-9 
2 4-4 a 0-70 43-32 180-2 
3 20-8 10 <0:025 >0-01 44-05 IEralelt 
4 22-5 7 <0-01 34:74 183-0 
4 13-0 5 <0-025 >0-01 107-45 190-63 
5 21-3 8 <0-01 >0-005 18-65 194:3 
6 36-5 7 <0-0005 78-81 190-4 
Oxygen intake 

I 7:8 9 > 0-50 0-0164 3-106 

2 9-2 fi > 0-20 0-0090 2-901 

3 15-1 10 > 0:10 0-0297 3-108 

4 6:8 " > 0-40 0-0267 3.640 

4 11:5 7 > 0-10 0-0322 3-055 

5 U3 <7 8 <0-025 >0-01 0-0230 3-002 

6 12-9 7 <0-1 > 0-05 0-0212 2-714 


As regards (iil) and (iv) it is important to note the following : if measure- 
ments of O and maximum H are actually made an unbiased estimate of 
maximum O can be made. This means that if the procedure were repeated 
a large number of times the average of all determinations will approach ‘ true” 


’ t 
maximum O more and more closely as the number of determinations is increased’ 
A 


on the same individuals. j 

When O (and maximum //) are determined from a population average value 
the situation is quite different. The estimate of maximum O is almost 
certainly biased, i.e. repeated determinations of maximum O and averaging 
do not ensure that true maximum O is approached more and more closel a 


to include true maximum O with given probability. These limits are no! 


Method of Estimating an Individual’s Maximum Oxygen Intake 115 


confidence limits in the usual sense ; their width does not decrease to zero with 
increasing number of observations. 

Estimated maximum O and its standard deviation was calculated by the 
method outlined in the Appendix for six subjects and the results are given in 


MAX. 
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Figure 5. Standard deviations of maximum oxygen intake are given for men with various levels 
of estimated maximum oxygen intake. A smooth curve has been fitted to the values. 
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Table 12. The values of standard deviation of maximum O were plotted 
against estimated maximum O and it will be seen in Fig. 5 that the points 
cluster closely about a smooth curve. It was assumed that the standard 
deviation of maximum O may be represented with sufficient accuracy by a 
simple mathematical function of maximum O and that the smooth curve drawn 
in Fig. 5 is, for present purposes, a reasonable approximation to this function. 

These results are used in the following routine classification scheme. Using 
the standard deviations in Table 12 and assuming normality of the distributions 


Table 12. Estimated maximum oxygen intake and standard deviations of six untrained subjects 


No. Om s.d.Om 
10 2-09 0-248 
11 1-47 0-122 
15 1:71 0-176 
20 3°28 0-660 
24 2-64 0-456 


28 3-12 0-507 


involved, the graphs in Fig. 6 were plotted. These are interpreted as follows : 
the dotted curve decreasing to the left of its starting point (estimated maximum 
O) on the top scale gives the probability that the value along the bottom 
scale exceeds the true value of maximum O, while the solid curve decreasing 
to the right of its starting point gives the probability that the value along the — 
bottom scale is less than the true value of maximum O. 
Two examples of how these curves may be used are : 
(i) Suppose that subjects are to be selected for some fairly arduous tasks 
for which it is considered unsafe or undesirable to have a maximum O 
of less than 1-5 1./min. At the same time as large a number of men as 
possible must be accepted for these tasks. This means that, with the 
lower limit of 1-5, an upper cut-off O, must be chosen so that it is the 
smallest value giving a reasonably small chance of a man whose true 
maximum OQ is 1-5 being selected for the hard tasks. If we want this 
chance to be 0-05 or less we find that O) =1-9 would be satisfactory 
because the probability of true maximum O being less than 1-5, given 
that the estimated maximum O is 1-9, is 0-034. 
(ii) Suppose that two limits 2-0 and 3-0 are decided upon. Those men — 
with estimated maximum 0 < 2-0 are assigned to light tasks and those — 
with estimated maximum O>3-0 having true maximum O<2:0 is 
2-7/100, and if more information about the distribution of estimated 
maximum O igs available the overall risk may be calculated. The 
‘waste ’ if the accent is on high-capacity men is represented by the 
_ probability that a man with estimated maximum O<2:0 has true 
maximum O> 3-0 and is 0-001 in this case. 


4.3. Comparison of errors in estimates of maximum O by Astrand and Rhyming’s 
method and by the present procedure 

One of the main sources of error in estimating maximum O by the method 

of Astrand and Rhyming is in using a point of common origin as one of the two 

plots of H against O to which a straight line is fitted. The study on 26 men 

brought out the fact that while the O’s at rest are not significantly different, 

the resting H values are very significantly different. Hence using a common 
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point introduces an additional random error. The error in estimating maximum 
O by Astrand’s method due to the variances ‘ between subjects ’ of resting H 
values alone was calculated by the method given in Appendix 6.5 on Subject 
No. 28. Using the variance of H, of 120 beats/min, then 

Variation of maximum O =0-53 s.d. 0-75 (Astrand and Rhyming method) 

=0-26 s.d. 0-51 (present method) 

In addition, the Astrand and Rhyming method has a bias due to the common 
point having been chosen as a heart rate of 60 beats/min at zero oxygen intake 
instead of using the population mean H and O values. As an example, in a 
man of 3 1./min maximum oxygen capacity, the bias could be of the order of 
0-2-0-4 1./min, depending upon the value of the heart rate observed. 

The further point should be made that in the present study the intention 
has been that the procedures in the practical testing of Native recruits should 
be carried out by non-scientific personnel. It is, naturally, relatively simple 
to recognize a gross error in heart rate measurement when four values at 
different rates of work are plotted against estimated oxygen intake. When 
only one measurement of H is made at a single rate of step climbing such gross 
errors cannot be observed. 

The method of fitting a straight line to plots of four measured H values 
and predicted O values was proposed also with a view to maintaining simplicity 
in the procedure. This proposed method is less accurate than fitting a straight 
line by least squares. When, however, accuracy greater than the quoted 
variances of maximum O is required, then fitting a line by least squares is the 
method of choice. 

f § 5. CONCLUSIONS 

5.1. Oxygen intake of an individual can be estimated with a coefficient of 
variation of 6 per cent from a population mean straight line relating oxygen 
intake for various rates of stepping to rate of work. This value is of the same 
order of magnitude as the residual variance of oxygen intake due to day-to-day 
variations and observer errors. The random error in estimating oxygen intake 
from a ‘ standard ’ oxygen intake/work rate graph is therefore no greater than 
the variances in measured oxygen intake due to error and physiological varia- 
tion. Measuring the ventilation volume during work does not improve the 
accuracy of the estimate of oxygen intake by more than a small percentage 
and the procedure is not therefore warranted. 

5.2. The individual variability of maximum heart rate about the population 
mean is small. The mean for six trained men is 184-1 beats/min with a standard 
deviation of 8-4 ; and of untrained men it is 179-3 beats/min with a standard 
deviation of 5-9. The means are therefore not significantly different. Taking 
a maximum heart rate of 180 beats/min as the population mean for Bantu 
introduces a very small error in the determination of maximum oxygen intake. 
5.3. Heart rate is a linear function of oxygen intake over most of the range 
of work up to the individual maximum. N ear the maximum, however, there 
a sharp departure from linearity in most individuals, so that the maximum 
oxygen intake actually measured is greater by about 08 L/min than that 
estimated by extrapolating the straight line to the maximum heart rate level 
and reading off the corresponding oxygen intake value on the graph. 
5.4. These conclusions are the basic premises upon which various rapid 
methods of estimating an individual’s maximum oxygen intake are founded. 
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The detailed procedures differ, but the essence of the methods is similar. The 
procedure is that heart rate is measured at one or more rates of work ; oxygen 
intake is estimated from a ‘standard’ graph relating oxygen intake to rate of 
work ; pairs of heart rate and oxygen intake values for various rates of work 
are plotted on a heart rate/oxygen intake ‘grid’ ; a straight line 18 fitted and 
extrapolated to the maximum heart rate value and the corresponding oxygen 
intake value is an estimate of the individual’s maximum oxygen intake. The 
procedure proposed by Astrand and Rhyming (1954) is based on a single 
measurement of heart rate at ONE rate of work. It has four possible sources 
of error. The heart rate value at rest may be biased ; it has a random error ; 
measurements of heart rate during work also have a random error ; estimating 
oxygen intake from a ‘standard’ graph has a random error. Some of these 
errors are eliminated by a procedure proposed by us which is based on fitting 
a straight line to the plot of four pairs of heart rate and oxygen intake values. 
The errors of these two methods have been compared. The standard. deviation 
of maximum oxygen intake of a test subject using Astrand and Rhyming’s 
method was 0-73, compared with 0-51 standard deviation of our method. 
In addition, Astrand and Rhyming’s method has a systematic bias of about 0-31. 
oxygen intake/min. The accuracy of estimation by the present method could 
be improved by fitting the straight lines by least squares. 


Les hypothéses sur lesquelles sont basées des méthodes simples d’évaluation de la consom- 
mation maximale d’oxygéne chez un individu ont été éprouvées expérimentalement. 
Ces hypotheses sont : \ 
(a) La fréquence cardiaque et la consommation d’oxygéne sont réciproquement des fonctions — 
linéaires pour toutes les puissances jusqu’a la puissance maxima d’un individu. 1 
(6) La consommation d’oxygéne d’un individu differe trés peu de la droite moyenne exprimant, — 
a Vintérieur de la population, la relation entre la consommation d’oxygéne et-le travail — 
de telle sorte que la consommation d’oxygéne pour une épreuve de travail effectuée conta 
la pesanteur peut étre évaluée avec une bonne précision a partir du travail effeciué. 
(c) La variabilité individuelle de la fréquence cardiaque maxima de part et d’autre de ia | 
moyenne de la population est suffisamment petite pour autoriser ’emploi de cette moyenne 
dans une épreuve d’effort courante sans introduire de grandes erreurs. i 


Les hypothéses (b) et (c) sont pleinement vérifiées par les résultats exposés ici. L’hypothése (a) 


mest pas strictement validée, du fait d’un écart deJ’ordre de 0, 32 l/mn d’oxygéne par rapport ai 
la relation linéaire entre la fréquence cardiaque et la consommation d’oxygéne pour des niveaux 
de travail élevés, chez la plupart des individus. 

Les erreurs imputables 4 la méthode proposée par Astrand (1954) basée sur ces hypothéses 
ont été examinées et comparées avec celles de l’autre version de cette méthode, décrite dans le 
présent article. La variance de la consommation maximale d’oxygéne par la méthode d’Astrand, 
due aux erreurs de mesure de la fréquence cardiaque seule, est de l’ordre de 0,53, comparée & 
une variance de 0,23 par le procédé décrit dans cet article. 

De plus une mesure unique de la fréquence cardiaque par la méthode d’Astrand introduit une 
erreur d’environ 0,31 d’oxygéne/minute dans l’évaluation d’une consommation doxygene 
maxima aérobie de 3,0 1/mn. La variance de la consommation maxima Woxygeéne prédite 2 
laide de la présente méthode peut étre réduite en utilisant la droite des moindres carrés ajusteés 
aux quatre points expérimentaux correspondant aux fréquences cardiaques et aux consommations 
d’oxygéne observées pour quatre niveaux de travail différents. 


Die Voraussetzungen, auf die sich einfache Methoden oder Schatzungen der maximalei 
Sauerstoff-Aufnahme eines Individuums stiitzen, warden gepruft. 


Diese Voraussetzungen sin¢ 
folgende : ~ aa 


(a) Herzfrequenz und Sauerstoff-Aufnahme sind einander linear proportional durch da 
gesamte Arbeitsbereich bis zu den Maximalwerten eines Individuums. q 
(6) Die Sauerstoff-Aufnahme des Einzelindividuums weicht sehr wenig von der Hauptgerade 
ab, welche die Sauerstoff-Aufnahme und Arbeitsschwere des Durchschnitts di 
Bevélkerung zueinander in Beziehung setzt, sodass die Sauerstoff-Aufnahme fiir ein 


Arbeit welche gegen die Schwerkraft geleistet wird, mit gentigender Genauigkeit aus dé 
Arbeitsschwere geschatzt werden kann. 
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(c) Die idividuelle Streuung der maximalen Herzfrequenz um den Durchschnittswert der 
Bevélkerung ist geniigend klein, um diesen Durchschnittswert in einem Routine-Test- 
Verfahren ohne grossen Fehler gebrauchen zu kénnen. 

Die Angaben (6) und (c) sind hinreichend durch die hier angefiihrten Ergebnisse begriindet. 
Die Angabe (a) ist nicht streng giiltig, da es bei hoher Arbeitsschwere bei den meisten Individuen 
sme Neigung der gradlinigen Beziehung von Herzfrequenz und Sauerstoff-Aufnahme in der 
Gréssenordnung von 0,3 Liter Sauerstoff/Min. gibt. 

Die Fehler, die der auf der gleichen Voraussetzung beruhenden Methode Astrand’s (1954) 
anhaften, wurden untersucht und mit den Fehlern eines anderen Verfahrens verglichen, das in 
der vorliegenden Arbeit beschrieben wird. Die Streuung der maximalen Sauerstoff-Aufnahme 
bei Anwendung der Astrand-Methode, die allein aus Messfehlern der Herzfrequenz entstehen, 
liegt in der Gréssenordnung von 0,53 verglichen mit 0,26 bei dem hier beschriebenen Verfahren. 
Zudem fihrt die alleinige Herzfrequenz-Messung der Astrand’schen Methode zur Ueberschatzung 
einer individuellen maximalen Sauerstoff-Aufnahme von 3,0 1/min um ungefahr 0,3 1/min. Die 
Streuung der maximalen Sauerstoff-Aufnahme-Messung durch die dargestellte Methode kann 
dadurch verkleinert werden, dass man mit der Methode der kleinsten Quadrate eine gerade 
Linie durch 4 Paare von Herzfrequenz- und Sauerstoff-Aufnahme-Messungen fur 4 Gréssen der 
Arbeitsschwere legt 
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STATISTICAL APPENDIX 

1. Variability of H and O at Different Levels of W Step Tests 
The effects of different rates of W on the ‘ residual variability ’ of H and O 
s small and irregular, although there appeared to be a trend in some instances. 
Bartlett’s test was used to test the residual variances for homogeneity and 
the values of y?, the test criterion, are given in Table 9. _ They indicate that 
the hypothesis of uniform residual variation is reasonable, therefore the 
ndividual residual variances were pooled. The pooled values are also given in 
Table 9, and it seems that the residual variance of H is characteristic of the 
ndividual whereas that of O is not, because testing, again by Bartlett’s test, 
between individuals gives the results 
Oxygen intake y, = 4:51 (not significant) 
Heart rate x, =15-13 (very significant) 


Bicycle Ergometer Tests a 
Tests of homogeneity of the residual variances of H and O were carried out, 


ind Table 11 contains the results of the test of homogeneity and pooled residual 
variances for H and O for each of these six individuals. Some of the individual 
values of X2 are high but plotting the individual variances against W showed 
ho clear trend, so that pooling of the residual variances appears to be justifiable. 


2. Estimates of Variances and Covariances of O|W for 26 Men 
Estimates of «,’s and f,’s for the 26 men are given in Table 13, together 
ith their individual variances and covariances. The variances and covari- 


inces between a,’s and f,’s, calculated from values in Table 13, are - 
a ~ 
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var (a,)) <=" 0:004759 
var (B,) = 0:0%1040/36 0-082829 
cov (&85) =—0-041168/6 — 0:0°195 


Using these numbers, estimates of var (a,), etc., may be found because it is 
approximately true that 

var (a) + >{ var(@,|o) } =var (a), ete. 
From the Table : 


mean var (|) = 0-005415 
mean var (B5|f.) = 0-06187/36gun 0:0 -1448 
mean Cov (485/083) = —0-:041239/6 —0-0°206 
and finally 
var (a) =0-03244 
var (B3) =0-075213/36 


COV (a 85) =0-0°71/6 


Table 13. Estimates of parameters of straight line fitted to O, W and variances and covariances 


Subject 
No. a B var (a) var (B) ~_ —cov (a, B) 
a 0-309 000255 0-00191 0:07229 0 05505 
8 0-284 0:00255 0-00135 0-07190 0-0°388 
9 0-417 0-00185 0-00483 0-07682 0-04139 
10 0-371 0-00194 0-:00448 0-07613 0-04127 
11 0-301 0-00268 0-00507 0:07759 0:04150 
12 0-247 0-00276 0-00219 0-07314 0-0°634 
13 0-401 0-00186 0-00143 0-07134 0-0°335 4 
14 0:305 0-00264 0-00520 0-07810 0:04157 | 
15 0-258 0-00234 0-00293 0-07458 0-0°886 i] 
16 0-463 0-00179 0-01636 0-0°188 0-04424 i 
ily 0-320 0:00275 0 00678 0 07700 0:04167 } 
18 0-436 000229 0-01345 0-0°192 0-04389 y 
19 0-282 0-00234 0-00014 0-08183 0-0°465 ‘ 
20 0:375 0-00255 0-00443 0-07493 0-:04113 | 
21 0:287 0-00251 0-00214 0-07350 0-0°663 
22 0-287 0-00243 0-00020 0-08309 0-0%598 
23 0-323 0-00271 0-00007 0-0°837 0-0°188 
24 0-24 0:00227 0-00406 0-07589 0-04118 
25 0-373 0-00173 0-01198 0-0°181 0-04356 
26 0-330 0-00230 0-00139 0-07186 0-0°389 
27 0-536 0-00224 0:00894 0-07750 0:04198 
28 0-330 0:00227 0-00456 0:07527 0-04118 
29 0-323 0-00208 0-00386 0-07531 0-04110 
30 0-336 0-00201 0-00640 0:0°797 0:04173 
31 0-350 0-00185 0:00257 0-07353 0-05729 
32 0-309 0-00250 0-00065 0-08832 0-05178 


This means that at a W of 900 ft lbs/min for example, the variance of true 
is 0-03244 + (900)? x 0-075213/36 + 1800 x 0-0°71/6 =0-001630. This value is 
smaller than the residual variances of O due to day-to-day variations. 


3. The Effect of Measuring V (Respiratory Ventilation) on the Precision o 
Estimating O from W 
The effect of V on precision of estimating O from W was examined 
follows : 
For given W, true O and true V or true O and V (estimated by usins 


measurements of V) are, from man to man, random variables, and unless the 
are independent, it may be expected that 
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var (O|WV)|var (O< W) 
30 that making measurements of V should enable one to set narrow limits to 
she individual maximum oxygen intake. 

Values of a, and f; in the V =a,+8, W relationship were calculated and, 
Ising var (a3 |v3) and var (a3), etc., as above, the correlation between O and V 
vas estimated. For W’s of 400 ft lb/min and above this correlation was about 
)-4 and, assuming that the order of the reduction in var (O|W, V) below var 
O|W) is similar to the occurrence in normal distribution theory (where 
var (O|W, ,)=var (O|W) (1-r?) and r is the correlation coefficient) s.d. 
O|W, V)=0-9 s.d. (O|W). 


4. Calculation of ‘ Probability’ Limits of Maximum O 

For calculation of the limits, the O’s (and Hj) are most conveniently 
regarded as random variables with variances calculated (in the case of Q) 
ising var (a), var (83), cov (a,8,). The observed variance of Hm between the 
six subjects (Table 9) is used. 

The actual procedure selected for determining Omax from the four pairs 
my, vy) ..: (04, hy) is 

On =Y + YEm 
(O, +0,)—(O3+0,) 
(hy + hy) — (hg +h) 

Y =0—8h, O andh being the means of the four O’s and h’s respectively. 
Chis choice of O, was dictated largely by considerations of simplicity because 
t is anticipated that relatively unskilled personnel will be using the method on 
nines. 

The variance of Oj, is given approximately by 

var (Om) =u'mu 


where s= 


where wu’ =(1 Hyp, ’) 

var (Y) cov (y, 5) cov (y, Hm) 
nd M -( cov (y, §) var (8) cov (6, ) 
cov (Y, Hm) cov (6, Hm) var (Hm) 
he following notation was used in calculating the elements of 


m) 


m® =(0,+0,)—(0,+0,) 2(%1) =6: 
2 =(hythe)—(hgthy) Dd (%2) =be 
=0, fa =h ; S(f,) =bx Xa) =b2 
ar (2,)/f,? =c, . , var (22)/E” =Ccp . , var (f1)/$1" =C- 1 
ar (f2)/f2” =C - 2, cov (#1 5 f1)/E1¢1 =C11 » 
OV (Xo, fo)/Eob2 =Co2, and the results derived were :— 
ar (Y) =$1C . 1+ 86%(Cy. +C,- +6. 2— 2g) — 2812011 
ar (8) = §(c,. +€y.) 
ov (y, 5) =¢15e;;— $28" (¢ . +02 - — Cap) 
v (Y, Hm) =cov (fi, mS ($2/E2) cov (%1, Hm) 
ty (5, H,,) =cov (#1, : 
ss ae ie ek of iy fea etc., is fairly simple. Writing W =(W,+ W,) 
-(W,+W,) and w for the mean of the Wi’s we have 
is var (%,) =w?var(b,) 4 
var (f;) =Var (a) +w? var (bz) + 2w cov (agb5) 
cov (x,f,) =w cov (a2b,)+ Ww var (b,) 


ee 


TOO J.S. Maritz et al. 


var (%,) =407n 

VateCta) = —=Oon 

COW (2, fo) =O 

cov (%,H,) =W cov (Hmbz) 

cov (f,H,)) =cov (Hmaz)+w cov (Hmb,) 

As explained above, observed var (Hj) was used, and the same applied to 
cov (Hm, a) and cov (Hm, bz). These estimates are very rough but it was, in 
fact, established that they do not affect the result very much. The main 
contribution to var (Oj) is due to var (f7). 


5. Error in Estimating Maximum O by Proposed Procedures 
O80 Be) (0,0, van, Fy 
(oH) 
where H, and O, are the assumed common point, viz., resting values, and H 
and O, are H and O values at work. 
The ‘ between subject’ variance in resting H can be determined from 
residual variances on the 26 men and the four trained subjects, and is 
(42)(10) =40 =120 
Hence, taking hypothetical values of 
H,= 80 and O,=0:3 [/min 
H, =140 and O, =2:1 1./min 


Variance maximum O = 


Then 
maximum O =3-3 |./min 
s.d. =0-25 
and coefficient of variation ae =8 per cent 


This figure is of the same order as the s.d. of maximum O given in a preceding — 
section. The final s.d. from two observations may, however, be expected to 
be larger than that from four observations, and can be calculated as follows : 

180 9 H, 180—H,)? 


Var (Om) =var (0941 ta} {aa t var (0,) 
sip eee a eh 


H,—H, A,—Hy 
of) BSA) = UE 
re (H.—H}) } var ( 2) (O,—0,) 


(180—H, (0,—0,))? 
| a 

where H, is the assumed common resting H, and 

0,=A+BW 

var O, =var (a)+2W, cov (ab)+ W,?2 var (b) 

var O, =var (a)+2W, cov (ab) + W,2 var (0) 

cov (0,0,) =var (a)+(W,+ W,) cov (ab)+ W,W, var (b) 
as before. 

Applying this procedure to one of the 26 subjects, No. 28, and using t 
variance of H, of 120 beats/min, we then have 
var Om =0-53, s.d. 0-73 (Astrand and Rhyming method) 
=0-26, s.d. 0-51 (present method) 


THE ESTIMATION OF BODY MEASUREMENTS OF BRITISH 
POPULATION IN RELATION TO SEAT DESIGN 


By D. BarKkna 


Furniture Development Council, Research Department, 2 Dalmeny Avenue, London, N.7. 


Reports of surveys of human measurements are not alw ays easily accessible, 
and their information about dimensions relevant to seat design is often mixed up 
with other matter. This paper brings together the principal published information 
on relevant dimensions. Most of the populations surveyed are not themselves 
important to British designers, but estimates of the measurements of young British 
adults are derived from the published material. 


$1. INTRODUCTION 


THERE are many different opinions about how chairs should be designed. 
One thing, however, seems to be generally agreed: that the dimensions of the 
chair should be in reasonable agreement with the dimensions of the person 
sitting init. It is also widely held that he should be able to change position 
easily. Good fit alone may not ensure comfort, but bad fit will almost certainly 
ensure discomfort. 

Unfortunately, human body measurements that designers could use have 
often been contained in surveys and reports which are not easily accessible. 
This paper summarizes a number of these. Few were British or were intended 
to provide data for designers of seats, but in this paper estimates of British 
measurements have been derived from them. The assumptions underlying 
the derivation are pointed out. Nearly all the data, and hence the estimates, 
refer to adults under the age of 40. 


§ 2. AvaILABLE MEASUREMENTS OF SELECTED DIMENSIONS 


The published dimensions considered important to seat designers are set 
put in Table 1. They are usually defined as follows: 

Subjects are measured naked, and, except for stature and shoulder width, 
are seated on a horizontal table with trunk erect and looking directly forward, 
upper arms vertical and forearms horizontal, underpart of thighs just touching 
she table, knees together, legs vertical, and feet on a horizontal rest. 

Stature: subjects standing erect looking directly forward; from floor to 
vertex, hair pressed down. 

Shoulder width: standing erect with arms at sides and palms forward; 
Erance between outer aspects of shoulders, across the middle deltoid muscles. 

Sitting height: from seat to vertex, hair pressed down. 

Shoulder-seat: from seat to top of shoulder (highest bony point) or from 
eat to top of breast-bone (sternum). The two dimensions have practically 
dentical values. 

Lumbar height: from seat to maximum inward deflection of the back. 
_ Elbow-seat: from from seat to bottom of elbow. 
~ Buttock—back of calf: from back of buttocks to back of calf at its maximum 


liameter 
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Table 1. Means and standard deviations of sek 
Source Sex Nationality Number in Age Stature Weight ae aera Bs 
number sample 2 D 
Sub- Total 
sample sample worm oorw pb a be a be fo] bb o p 
i 5 39°2  D-A§ 53° 758 36° 2 23°8 0-93 
land2 Male American 2960 69-2 2-43 153:1 1758 36-4 1:20 
il 45 584 67:9 (2:45) 147-1 (15°81) 35-9 (1:19) 23:3 (0-96) 
2 . ry 103) 34-1 Go 5-98 69:0 2-28 167-1 27-04 36:6 1:06 Z3°7  t-06 
2 i os 271 38-9 Go 8:13 68:3 2-41 166:9 27-93 36:2 1:16 23°7 1:00 
3 Bs 53 25000 m™22 wib5b-40 68°56 2:51 154:8 20°49 35:8 1:34 23:0 1-17 
5 33 ” 4063 27-9 o 4:22 69:1 2-44 163°7 20:86 35:9 1:29 23:3 ii4 9:2 
5 33 93 1T84 228-9 o 3:61 69-4 2:3 165:7 20-5 36:1 1:24 23-4 1:09 9-3 
4) 2” By 774 26-9 ao 214 69:5 2:22 164:0 20°51 36°1 1-27 23°4 93 
5 3 >» 398 132-8 o 2:52 69-4 2-46 167°8 20-31 36:1 1-26 23-4 9-3 
5 % 2 81 436-5 o 3°33 69:1 2°20 168:4 20°57 36°0 1:15  23°5 9-2 
5 59 a 94 32-2 a317 69:6 2-46 17s'2 1977 36:2 1:36 
6 ” 3 1959 38:5 wl7-89 69:0 (2°89) 166-7 (25:1) 
7 és Swedish 295 69-3 
7 5 % 274 
7 2 %9 248 
8 and 9 a5 British 2241 p21-2 wi8-38 684 2:47 139°6 16:09 35:7 1:26 
9 ” % 529 23-2 Go 3:26 68:5 2:37 144:9 15:90 362 1°24 
10 93 o3 361 m24 72-3 150 3% 1-01 
10 cr) 4 1844. m24 67-9 2°39 S675 1°97 
13 as of various 23:0 o@ 1:52 68:1 2:29 148-3. 16:85 ~35:4 1:33 21:6 1-06 
1 Female American 447 64-9 (1°75) 128-6 (12-4) 34°1 (1-02) 
al > =) 152 63°5 (2°33). 121°9 (12:4) 33°7 (1:19) 
4 35 aA 8500 = m25 wi8-50 63:9 2-41 131-9 19-96 Ze) OS 
11 sh oi 10042 = m32 63:2 2°48 133-5 25-98 : 
6 ” ” 1908 35:5 w17-84 64:9 (2°51) 133-1 (21-5) 
ii “5 Swedish 363 64-2 
a ” 33 251 
7 % % 59 ] 
many = 
12 55 British 2366 wi18-29 63:4 2:48 126°6 18-37 
12 33 ry) 1554 w30-44 63:1 2°63 134-6 22-56 
12 35 1075 w45-64 62:1 2:55 1438°8. 25°56 
12 i 4995 wil8-64 63-0 2:59 132:8 22-45 


Weight is tabulated in pounds, other dimensions in inches. 
m=median (age of subjects), o=standard deviation, w=range. 


have been estimated for this paper.) 
The following notes refer to each numbered source in Table 1, and show as fi 


possible how samples were constituted, and whether the definitions of dimensions differec 


those given above. 


(U5 


Randall et al. (1946). 


Samples: males between 60 in. and 76 in. tall, and weig 


/4=mean or median (dimen 


tween 1201b and 2001b (2960 Aviation Cadets, 584 gunners). 


specified limits of height and weight (447 pilots, 152 Air Force nurses). 
Measurements: subjects sat with legs dangling, not on a foot-rest. 
measured with arms at sides of thighs. 


MacFarland et al. (1953). 


U.S. Army men. 


to patella, not to top of knee. 


Hooton (1945). 


and Chicago railway stations. 


seat > (they were not parallel). 


Measurements: subjects clothed and shod, sittin 
adjustable chair with seat tilted at 7° from horizontal. 


Randall and Munro (1949) and Randall (1947). 
Measurements: subjects sat with feet on a non-adjustable rest. 


Sample: about 8500 U.S. Army 
Knee height from f 


Fry and Churchill (1956) and Hertzberg et al. (1954). 
personnel at 14 Air Force base (4063 men); (ii) 94 Strategic Air Command pilots. 
Sub-groups of sample (i): All bomber pilots: under 30, 774 men; over 30, 


over 34, 81 men; total, 1184 men. 
Samples: 1959 men and 1908 women volunteers over 17, at 


Knee height from base of heel to top of kne 


Samples: 103 champion lorry drivers, 271 regular lorry 
mostly from Texas or New England. Measurements: as for source 1 (but subjects el 


Newman and White (1951), Randall and Baer (1951). 


(Values bracketed in 1 


Sample: about 25 00 


Samples: (i) all availabl 


Elbow-seat: ‘ from arm 
Buttoeck—back of calf: from back of chair to back 


Females betwet 


Elbow 


» ab 


Underside of thigh—floor: seat height at back of chair, plus corrections for slope of 
compression of thigh when it projected beyond the front edge of the seat, and for 


who said. they took shoes off in trains. Seat width: maximum spread across uppe! 
or buttocks. 
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mensions collected from various sources 
— Elbow- Buttoek— Buttock— Knee height Underside of Shoulder Elbowwidth Buttock 
seat knee back of calf thigh — floor width width 
bb o be o iu o Bb o jai o be o be o be fo) 
8 23-6 1-06 22-0 0:94 18:0 O77 16:7 099 14:0 0°75 
4) 23:1 (1-04) 21:5 (0-96) 17°7 (0°74) 16:4 (1:11) 18-8 (0-82) 
6 23-7 1-09 21-7 1-03 18-7 1°05 17°3' 1:67 «814:7 1:03 
0 Zoe U-t2 21:6 0:99 18:3. 0:97 17:9 1:68 145 1:08 
1 23°4 1-12 21:6 1-09 17-9 0:99 17-5 1:54 14:0 0-90 
9 O92 1:04 23°6 1-06 21-7 0-99 17-0 OTT. 17:9) 0:91 17-3 1:42 14:0 0:87 
8 9-20 91-02, 23-7 1-03 21-8 0-96 aloo O74 18:0 O89 17:55 1:41 14:1 0:87 
9-2 23-7 0:99 21:8 Liga 18:0 0:86 17:3 1:37 14:0 0:87 
9-2 23°8 1-05 21°7 17-0 18:1 0:91 17°8 1:44 14-2 0-87 
9-3 23°7 1-01 21°6 16-9 18-1 17:8 1:49 14:2 0:84 
23°7 1:01 14-8 0-86 
9-6 (0°89) 18-9 (0-96) 19:0 (0°89) 17:6 (0:67) 15°3 (1°11) 
(17-2) (0-81) 
(19-3) (0-95) 
(21-7) (1-05) 
22°8 1-25 
23-3 1-03 168 0-69 
10-9 118 24-4 0-81 223 1:03 15:3 0-42 17:0 0-63 18:0 0-77 15-4 0-70 
105 0-94 23-4 1-07 20:8 1:12 15-3 0-53 165 0-68 17:7 0-83 15:3 0:69 
8-7 1:04 22:3 1-00 17°9 0-90 19-9 0-99 18:5 0:84 13-9 0:84 
8) 22-6 (0-96) 20-1 (0-88) 16-1 (0°88) 15-1 (1-17) 15-0 (1-03) 
1) 22-4 (0-88) 19-5 (0°58) 15-7 (0°58) 14-9 (0-88) 15-1 (0°89) 
3 18:3 0:98 
6 
9-7 (0°89) 18:2 (1-04) 18:1 (0°89) 15-7 (1-04) 14-6 (1:04) 


(15-7) (0°81) 
(18-7) (0-97) 
(20-5) (0-81) 


Akerblom (1948). Sample (or samples?): ‘non-selected ’ Swedish factory and_ office 
kers. Measurements: subjects clothed but without shoes, sitting, presumably on their 
n chairs. Buttock—back of calf: from back of chair to back of calf. 

Morant (1943) and Morant and Gilson (1945). Sample: volunteers for R.A.F. flying 
ies, mostly from southern England. Men less than 60 in. tall excluded. Measurements: 
jects sitting on a form with buttocks and shoulders against a wall. 

Morant and Gilson (1945). Sample: R.A.F. aircrew, including 185 pilots (not less than 
n. tall) and 14 gunners (not more than 72 in. tall). Measurements: subjects sitting on a 
ol adjusted so that tops of thighs were horizontal, and with backs against a measuring 
rd. Shoulder width: with arms folded. 

Hill and Morant (1945). Samples: Royal Armoured Corps tank personnel: 361 Guards 
b less than 69 in. tall); 1844 from other units. Measurements: subjects clothed but 
hout boots or tunics, sitting with backs against a measuring board, Elbow-seat: with 
s pressed back against board. Underside of thigh—floor: the height of the seat. Elbow 
th: distance between outer surfaces of forearms, about 2 in. in front of elbows. Seat 
th: distance between outer surfaces of thighs just below greater trochanters, with knees 


O’Brien and Shelton (1941). Sample: volunteers from various women’s organizations; 
sewives and country women inadequately represented. Measurements: Shoulder—elbow: 
h hand on hip, from point of elbow to intersection of armscye and Boulder: ners. 
Kemsley (1957). Sample: volunteers from firms and various women’s organizations; 
king women and Londoners over-represented. Sub-groups: age groups 18-29, 30-44, 
45-64. Measurements: subjects wearing their usual foundation garments. 

Geoghegan et al. (1957) and Roberts (1957). Sample: non-commissioned naval and 
al Marines personnel at Portsmouth. Measurements: stature and weight, measured 
ly. Subjects then photographed and measurements later read from various sub- 
es (containing between 61 and 287) of the photographic records. Sitting height was 
wed directly on another sample of 63. Buttock—knee and buttock—back of calf: with 
axis horizontal, measured from lumbar plane, not back of buttock. Shoulder width: 
rms hanging. Knee height: from floor to upper margin of patella. (Note also thigh 


th, maximum: p= 15-9 in., o=1-10 in.). 
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Underside of thigh-floor: from foot-rest to underside of thigh just behind 
he knee. 

Elbow width: distance between outer aspects of elbows. 

Buttock width: distance between bony protuberances at upper ends of 
highs (the greater trochanters). 

Shoulder-elbow: from top of shoulder (highest bony point) to bottom of 
bow. 

Buttock—-knee: from back of buttock to foremost point on patella. 

Knee height: from foot-rest to top of muscle mass near lower end of femur. 

Waist height: from seat to greatest lateral abdominal indentation, or where 
the belt is worn. 


§ 3. Estimating MEASUREMENTS OF THE BritisH PopuLATION 


Table 1 contains very little about people over 40 years old. Older people 
end to be shorter, but that cannot be described quantitatively. The estimates 
siven in this section refer to people between 18 and 40 years of age. Within 
these limits it is possible to use Table 1 to provide an estimate of the means 
und standard deviations of British measurements for each dimension. The 
main difficulties are that most of the available measurements are American, 
shat most of the measurements are of service men and women and not, there- 
ore, representative of the whole population, and that the definitions of some 
limensions differ from one source to another. 


(a) Estimates of means for the male population 


Table 2 shows mean measurements expressed as proportions of stature. 
in the evidence for males, sitting height, buttock—knee length, and knee 
aeight, the only really comparable measurements, show good agreement 
etween British and American values. Several other measurements look to 
e in reasonable accord in view of stated differences in measuring technique. 
t is assumed, therefore, that national differences in mean bodily proportions 
ay be neglected here. 


Fable 3. ‘ Best’ estimates of body dimensions as proportions of stature (rough averages from 
, Table 2, allowing for differences of technique: see Key to Table 1) 


Males Females 

Sitting height 0-525 0°525 
Shoulder — seat 0-340 0°335 
Waist height 0-135 

Shoulder — elbow 0-210 0-210 
Elbow — seat 0-130 0-125 
Buttock — knee 0-342 0°350 
Buttock — back of calf 0-280 0-290 
Knee height 0-315 0°310 
Underside of thigh — floor 0-245 0-245 
Shoulder width 0-260 0-250 
Elbow width 0-255 0-250 
Buttock width 0-205 0-235 


Although very small men are excluded from the services, the mean propor- 
ons of servicemen may be used for the whole population, since Table 2 
idicates that moderate differences in stature do not lead to substantial 


ifferences in proportion (vide source 10, and, less strikingly, source 1). 


. 
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If there is a reliable measure of at least one dimension in the whole British 
population, proportions from other samples can be used to provide absolute 
values for the British population. Morant (1947) states that the mean stature 
for British adult males is 67-5in. Clements and Pickett (1957) say it is 
67-05 in. Both those values raise problems, since they differ sharply from the 
values of 68 in. or more obtained from servicemen. Perhaps servicemen are 
much more highly selected for stature than they are supposed to be; or perhaps 
military training increases apparent stature, by altering posture. If training 
does increase apparent stature, it may be inappropriate to ‘scale down ° all 
other servicemen’s measurements to suit civilian stature; training seems 
unlikely to alter all measurements in the same way and to the same degree. 
However, from the data it can only be assumed that the civilian mean stature 
is really between 67-05 in. and 67-5 in.; say 67-25 in. 

The estimates of means of other dimensions for British adult males are 
given in Table 4. They are obtained by multiplying 67-25 in. by the * best * 
estimate of each dimension as a proportion of stature (Table 3). 


Table 4. Estimated measurements, in inches, of the British population between 18 and 40 


Dimension Men Women 
pe o 90 per cent iu o 90 per cent 
range range 

Stature 67-25 2-6 63-00—71-50 63°25 2-6 59-00-67-50 
Sitting height 35-25 1-4 33-00—-37-50 33°25 1-4 31-00-35-50 
Shoulder — seat 23-00 1-2 21-00—25-00 21°25 1-2 19-25-23-25 
Lumbar height (8-12)* 
Elbow — seat 8-75 lei 7-00-10-50 8-00 Tel 6-25—- 9-75 7 
Buttock — back of calf 18-75 1-0 17-00—20-50 18-25 1-0 16-50—-20-00 
Underside of thigh — floor 16-50 0-8 15-25-17-75 15-50 0:8 14-25-16-75 
Shoulder width 17-50 1-0 1575-19-25 15:75 1-0 14-00-17-50 
Elbow width 17-25 1-6 14-75-19-75 15-75 1-6 13-25-18-25 
Buttock width 13-75 0-9 12-25-15:25 14-75 0-9 13-25-16-25 


j4=mean (to nearest 0-25 in.), c=standard deviation (to nearest 0-1 in.), 90 per cent range=5t 
and 95th centiles (to nearest 0:25 in.). 


* Darcus and Weddell (1947): a sample of sailors; mean stature not given. 


(b) Estimates of means for the female population 

In estimating means for the unselected British adult female populatio 
from the available material, the same assumptions have to be made as fi 
males. Kemsley (1957) states that the mean stature for British females age 
18-44 is 63-3in. This agrees well with a value for American civilian women 
but not with values for American service women. 

The estimates of means for British adult females are given in Table 4. I 
arriving at the * best ’ estimates of proportions (Table 3) it is assumed, becau 
the data for females is so scanty, that corresponding sources for male a 
females are similarly biassed. The assumption is supported by the values 
Table 2 for shoulder-elbow, which are higher in source 1 than in source. 
(males) and the corresponding source 4 (females). 
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(c) Estimates of standard deviations for the male population. 


The standard deviations given in Table 1 agree fairly well among them- 
elves, but there are several possible sources of disagreement; the most 
mportant is variation among the populations (in general, the various services) 
rom which the samples are drawn. Service groups are so selected as to be 
nore homogeneous than the parent population—that is, they yield reduced 
tandard deviations of measurements—and they may also be so selected, or 
rained, as to yield different mean measurements. Inspection of Table 1 and 
he notes to it shows that the effects of selection are considerable. The least 
elected groups (sources 3 and 5) have the greatest standard deviations, and 
re used to provide the best estimate wherever possible. 

To compare the variabilities of samples with diverse means, the co- 
ficient of variation (standard deviation x 100)/mean is used. There is fair 
oncordance between the samples except where the group is highly selected 
r where there are clear differences of measuring technique. 


Table 5. Coefficients of variation: males 


Clements 
Source 1 3 % 6 “ 10 10 13 and. 
n= i i Pickett 
2960 361 1844 1957 
Stature 3°5 3°7 3-5 |Subjects| 3-6? 21 3°5 3-4 3°9 
wearing 
shoes 
Sitting height 3:3 3°7 3°6 2-7 3°2 3°8 
Shoulder — seat 3:9 5-1 4-9 4-9 
houlder — elbow 4-6 5:7 4-8 
lbow — seat 11°4 9°3 10:8 9-0 | 12:0 
Buttock — knee 4-5 4-8 4-5 3°3 4-6 4:5 
Buttock — back of calf 5:1 4-9 5:0 
Knee height 4:3 5:0 4-6 4-8 4-6 5:4 5:0 
nderside of thigh—floor 4-5 |Subjects| 4:7 2-7 370 
wearing 
; shoes 
Shoulder width 4-3 5-5 


3°8 37 4-1 4°5 


5-1 
ilbow width 5:9 8:8 8-2 
3 6-2 


3uttock width 7:3 


There are far greater differences in coefficients of variation between 
mensions than between samples. This might mean that some dimensions 
ally are more variable than others, but it might also mean merely that some 
mensions are harder to measure than others. Most of the values in Table 5 
fe reasonably concordant and reasonably low, but the coefficients of variation 
elbow-seat height and elbow width should certainly be regarded with 
{spicion. 
| The estimation of standard deviations of each dimension for the British 
pulation is not, in principle, easy. However, since the estimate needs to be 
jcurate only to 0-1lin. (and since the available material may itself be less 
lecise than it seems) the estimate for British males is derived from a sample 
hndard deviation by multiplying by 
| British standard deviation of stature (Clements and Pickett, 1957) 
sample $3 si a i (source 3 or source 5) 
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This procedure is not possible for the dimension buttock—back of calf, which is 
given in source 7, but not in source 3 or source 5. Since, however, source 7 
agrees well with source 5, it seems reasonable to guess at 1ts standard deviation 
of stature. 

The estimates based on this reasoning are given in Table 4. 


Table 6. Coefficients of variation: females 


Source 4 11 12 
Stature 3°8 3°9 4-0 
Shoulder — seat 5-1 
Shoulder — elbow 5:6 5:7 


(d) Estimates of standard deviations for the female population 


The available material is very defective. One group in source | had very 
few members and the other was highly selected for stature. The only reliable 
evidence is summed up in Table 6. The standard deviation for stature is the 
same for British females (source 12) as it is for British males (Clements and 
Pickett 1957). The only estimate of a British standard deviation that can 
be derived from the available material, shoulder—seat height (from source 4), 
is also identical with the value for British males. In view of this, it is tenta- 
tively assumed that each standard deviation for females is identical with the 
corresponding one for males (see Table 4). 


§ 4, APPLICATION OF THE ESTIMATES TO SEAT DESIGN 


The application of anthropometric data to design should be indirect as a 
rule; that is, the data should be used to specify the body sizes of people who test 
the equipment, rather than to obviate testing altogether. Natural postures 
are usually very different from the rigid ones most convenient for anthropo- 
metric surveys. Thus, there are several dimensions of chairs that are not 
directly governed by the dimensions specified in this paper. For instanee, 
the minimum acceptable height and depth of the seat, the shape of arm-rests, 
and the angular measurements are examples of these. However, there 2 
several other dimensions of chairs that depend directly on measurem 
quoted in this paper, for instance the maximum acceptable height and dep 
of the seat, and its minimum acceptable width; but it must be noted t 
buttock width is less than minimum thigh breadth. The measurem 
chosen for these dimensions will depend on what part of the population th 
chair is meant for. 

Any application of the estimates in Table 4 must take into account 
they refer to naked people. It is difficult to estimate the effects of clot 
and shoes on measurements, particularly for women, but values are given 


Roberts (1945) for American servicemen’s clothing. The increases in measul 
ments are: 


Shoulder width oe in indoor clothes 0-4in. with overcoat 
Elbow width 0-6in. in indoor clothes 1-8in. — with overcoat 


Buttock width  0-6in. in indoor clothes 1-lin. — with overcoat 
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Shoes cause an increase in the measurement of underside of thigh—floor. 
For men’s shoes the mean increase is 1-1 in., and for women’s shoes there are at 
present (1960) two modes, one at about 0-9 in. and one at 2-0 in. Slippers, 
both men’s and women’s, cause an increase of about 0-7 in. 


The work on which this paper is based is part of the research 
programme of the Furniture Development Council, and is reported 
here by permission of the Head of the Research Department. It has been 
fully described in FDC Research Report No. 8, Anthropometric Data for 
Chair Designers, by H. D. Darcus, M. J. Merrick and D. M. Barkla. Thanks 
are due to the Dunlop Rubber Company for financial support for these investi- 
gations, and to the British Boot, Shoe and Allied Trades Research Association 
for information about shoes. 


Ul est assez difficile d’obtenir les rapports sur la plupart des études de mensurations humaines 
et les renseignements relatifs aux dimensions utiles pour la création de siéges sont généralement 
disséminés parmis d’autres sujets. 

Le présent article rassemble les principales informations publiées concernant ces dimensions. 

La plupart des populations étudiées ne sont pas, elles mémes, importantes pour les eréateurs 
britanniques mais une estimation des mensurations des jeunes adultes britanniques est déduite & 
partir des données publiées. 


Berichte tiber die meisten Studien, die sich mit menschlichen Kérpermassen befassen, sind 
nicht leicht zugiinglich und die darin enthaltenen Angaben tiber die Masse, soweit sie sich auf 
Sitzm6belentwurf beziehen, sind zumeist mit anderem Material vermischt. Diese Arbeit stellt 
die wichtigsten der verdffentlichten Massiibersichten zusammen. 

Die meisten der behandelnden Bevélkerungen sind als solche fiir britische Entwiirfe nicht von 
[Interesse, aber K6rpermasschaétzungen von jungen britischen Erwachsenen werden dem 
ver6éffentlichten Material entnommen. 


REFERENCES 


Axrrsiom, B., 1948, Standing and Sitting Posture (Stockholm : A. B. Nordiska Bokhandeln). 

EMENTS, E. M. B., and Pickert, K. G., 1957, Stature and weight of men from England and 

Wales in 1941. Bri. J. prev. soc. Med., 11, 51-60. 

iDarcus, H. D., and WEDDELL, A. G. M., 1947, Some anatomical and physiological principles con- 

cerned in the design of seats for naval war-weapons, Brit. med. Bull., 5, 31-37. 

Pry, E. I., and CHurcHILL, E., 1956, Bodily dimensions of the older pilot. W.A.D.C. Technical 

Report 56-459. Wright-Patterson Air Force Base, Ohio. 

EHOGHEGAN, B., Roperts, D. F., and Witson, J. O. C., 1957, Miscellaneous functional measure- 

ments of naval personnel. Royal Naval Personnel Research Committee Report RNP 57/895. 

This document also includes Roberts (1957). 

Hertzperc, H. T. E., Dantes, G. S., and Cuurcuity, E., 1954, Anthropometry of flying 

_ personnel—1950. W.A.D.C. Technical Report 52-321. Wright-Patterson Air Force Base, 

Ohio. 

tL, A. Braprorp, and Morant, G. M., 1945, Report on body measurements of 2200 tank 

personnel. Military Personnel Research Committee Report BPC 45/406. M.R.C., London. 

looton, E. A., 1945, A Survey in Seating (Gardner, Mass. : Heywood-Wakefield Co.). 

‘eMstey, W. F. F., 1957, Women’s Measurements and Sizes (London : H.M.S.O.) (Published by 

the Board of Trade, for Joint Clothing Council Ltd.). 

(icFartanp, R. A., Damon, A., Sroupt, H. W., Mosztey, A. L., Dunrap, J. W., and Hatt, 

f W. A., 1953, Human Body Size and Capabilities in the Design and Operation of Vehicular 

| Equipment (Boston, Mass. : Harvard School of Public Health). 
forant, G. M., 1943, Preliminary report on body measurements of 2400 candidates for air crew, 

Flying Personnel Research Committee Report 538. Royal Aircraft Establishment, 

Farnborough, Hants ; 1947, Dimensional requirements for seats in R.A.F, aircraft. 

Flying Personnel Research Committee Report 682. R.A.F. Institute of Aviation Medicine, 

_ Farnborough, Hants. 4 

Morant, G. M., and Gitson, J. C., 1945, A report on a survey of body and clothing measure- 

; ments of R.A.F. personnel. Flying Personnel Research Committee Report 633 (a). Royal 

Aircraft Establishment, Farnborough, Hants. 


7 


132 D. Barkla 


Newman, R. W., and Warts, R. M., 1951, Reference anthropometry of Army men. Hnwviron- 
mental Protection Section Report 180. Quartermaster Climatic Research Laboratory, 
Lawrence, Mass. 

O’Brien, R., and SHerron, W. C., 1941, Women’s Measurements for Garment and Pattern 
Construction. Misc. Publ. 454. U.S. Dept. of Agriculture, Washington. 

Ranpatt, F. E., 1947, Survey of body size of Army personnel, male and female : Methodology 
and general considerations (female). Hnvironmental Protection Section Report 128. 
Quartermaster Climatic Research Laboratory, Lawrence, Mass. 

Ranvatt, F. E., and Basr, M. J., 1951, Survey of body size of Army personnel, male and female : 
Methodology. Hnvironmental Protection Branch Report 122 (revised). Quartermaster 
Climatic Research Laboratory, Lawrence, Mass. 

RanpDatt1, F. E., Damon, A., Benton, R. S., and Part, D. I., 1946, Human body size in military 
aircraft and personal equipment. Army Air Forces Technical Report 5501, TSHAA— 
695-43. Air Materiel Command, Dayton, Ohio. 

Ranpatz, F. E., and Munro, E. H., 1949, Reference anthropometry of Army women. Hn- 
vironmental Protection Section Report 149. Quartermaster Climatic Research Laboratory, 
Lawrence, Mass. 

Ropers, D. F., 1957, Descriptive anatomy of naval personnel. See Geoghegan et al. (1957). 

Roseerts, L. B., 1945, Size increase of men wearing various clothing combinations. Project No. 9. 
U.S. Army Medical Research Laboratory, Fort Knox, Kentucky. 


NECK-MUSCLE TENSION AND THE POSTURAL IMAGE* 


By F. P. Jonzs, F. E. Gray, J. A. Hanson and J. D. Shoop 
Institute for Psychological Research, Tufts University, 


Medford, Massachusetts 


The * postural images’ of comfort, of correctness, and of height were studied 
m seven male subjects aged 16 to 21. The image was recorded photographically 
and electromyographically as the immediate response to a specific verbal stimulus. 
The response was quantified by measuring the angular relation of head to trunk 
and the change in electric potential of the sternocleidomastoid and upper trapezius 
muscles. Both sitting and standing postures were recorded. In addition, the 
effect on posture of effort (lifting), anticipation of movement, forced respiration, 
and standing on tip-toe was studied. The data were treated statistically. 


§ 1. INTRODUCTION 


IN a previous paper Jones and O’Connell (1958) described a method for ex- 
pressing dynamic posture in terms of the changing relation between head and 
trunk during movement. If this relation is used as the basic parameter, 
posture can be recorded as a patterned response to a specific verbal stimulus. 
The method provides quantifiable data—linear and angular measures—which 
can be treated statistically. We have used the method in combination with 
the electromyography of neck muscles to study a person’s image of his ‘ most 
comfortable ’ posture, his ‘ best’ posture, and his ‘ greatest height ’. 

The idea of a ‘ best’ posture seems to be held almost universally in our 
culture. Everyone we have asked claimed that he knew what good posture 
was, though he frequently apologized because he himself didn’t have it. In 
lefining best posture, individuals disagree as to what is and what is not an 
xxample of it. When asked, however, to sit or stand in his best posture, 
everyone whom we have observed has made some kind of response, modelled 
upparently after an image, consciously or unconsciously held. This image is 
isually contrasted with ‘most comfortable’ on the one hand, and with 
greatest height ’ on the other. Occasionally, a subject insists that the three 
mages are indistinguishable—that when he is in his best posture he is at his 

eatest height, and that this is also his most comfortable posture. To most 
Bb cts, however, at least two of the three are distinguished from each other. 

The aim of the present study is to define these three images more objectively 
y measuring some of the changes that take place during the postural response. 
f the many variables that might be used as measures, we selected the relation 
f head to trunk. This relation is central in the organization of posture in 
nimals (Fulton 1949, Chapter 8) and we believe that it plays a rdle of similar 
nportance in human posture. We obtained simultaneous records of the 
elative positions of the head and trunk and of the electric potential in two 
uscles of the neck, the upper trapezius and the sternomastoid. 
The sternomastoids are the two oblique muscles on either side of the neck. 
he two parts of the upper trapezius form a triangle running from an apex at 
e occiput to a base line drawn through the seventh cervical vertebra. 


- 


* This study was supported by a PHS research grant from the Division of General Medical 
siences of the United States Public Health Service. 
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According to Gray (1948) the two sternomastoids “ acting together flex the 
vertebral column, bringing the head forward ” (p. 371), while the two sides of 
the upper trapezius ‘‘ draw the head directly backwarc ” (p. 423). 


Fig. 1. Subject with markers in place and triangle diagram constructed. Broken lines indica 
the horizontal. 


role played by the neck muscles in postural activity and to their importance 
indicators of the general level of muscular tonus. In this study we confined 


our attention to postural activity alone, recording from the two neck muscle 
and from one back muscle, the erector spinae. 
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§ 2. MprHop 


The subjects were seven male students ranging in age from 16 to 21. All 


of them were employed for the summer by Tufts University, six of them in 
maintenance work. 


2.1. General Procedure 


The subject was seated on a flat stool in an electrically shielded room. 
he relation of the head to the trunk was marked by two light plastic rods 
uttached over the seventh cervical vertebra and the sternal notch, and by a 
small cross centred in the Frankfort Plane, midway between the tragion of 
she ear and the lowest point of the orbit (Fig. 1). Pairs of Grass electrodes 
jwo inches apart were attached to the skin with Duco cement. One pair was 
laced over the right sternomastoid half-way between the insertion and the 
sternal head. A second pair was placed over the trapezius half-an-inch to the 
ight of the central ligament and half-way between the occiput and the seventh 
servical vertebra. A third pair was placed over the right erector spinae two 
nches below the lowest point of the trapezius. In addition, there was an 
sarth electrode over the right scapula. 

Muscle potentials were amplified by an Offner Type A Electroencephalo- 

sraph placed outside the shielded room. Two integrators were inserted 
between the preamplifiers and the power amplifiers. Their output was fed 
nto two channels of the Offner. Direct muscle activity and integrator output 
were recorded by an inkwriter on paper moving at 2-5cm/sec. The integrator 
scores appeared on the record as a series of condenser discharge ‘ pips’ pro- 
9ortional in rate to the amplitude of the signal. Photographs of the subject 
were taken by a Robot Star camera with a self-advancing film. The shutter 
was controlled by a solenoid operated by the experimenter from outside the 
900th. 
- The various procedures were explained to the subject in advance. He 
as asked, however, not to rehearse the movement but to wait for the signal 
nd then make whatever seemed to him the appropriate response. The picture 
as taken as soon as the response was completed. The inkwriter was started 
efore the signal and allowed to continue for approximately ten seconds after 
he picture had been taken. An auxiliary pen recorded the opening of the 
hutter and the signals given by the experimenter. 

For six of the subjects, the series of procedures was repeated in order to 
btain an integrated record of the third muscle. Consequently, 13 pictures 
ere taken for each procedure. For statistical purposes, data from Subject 7 
vere tabulated twice, giving 14 observations (n =14) for the postural patterns 
nd for the sternomastoid muscle. Since only one integrated record was 
btained from the upper trapezius, for this muscle n =7. 


: 2.2. Specific Procedures 
.2.1. Sitting 
(a) Postural images. The subject, with left profile to the camera, was 
eated in whatever posture he chose to assume. The instructions were then 
iven in the following order : “ Sit in your best posture i ; “Sit in your most 
mfortable posture’; “Sit at your greatest sitting height ”. 


VE 
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(b) Weight lifting. To test the effect of effort on the three postures, the 
following procedures were used. The subject, seated as before, let his right 
hand hang by his side. A standard 2 kg weight was placed beside him. The 
weight was attached to an iron rod and swung freely from a wooden handle 
which the subject was able to grasp easily without changing his position, 
From each of the three postures, the subject on signal lifted the weight by 
bringing his arm forward to the horizontal. The rate of lifting was regulated 
by a metronome. The three procedures were then repeated with a weight of 
4ke. For each procedure a picture was taken when the arm reached the 
horizontal. 

(c) Ready to stand. Four procedures were used to test the effect of 
anticipatory ‘set’. 1. The subject was instructed to move on signal from 
sitting to standing. In preparation for the movement he was asked to sit in 
his most comfortable posture. A photograph was taken just before the 
signal to get up. Both the preliminary posture and the movement were 
recorded electromyographically. 2. The procedure was the same as in lI, 
except that the subject was asked to get up more slowly than he was 
accustomed. 3. The same as 1, except that no instructions were given for the 
preliminary posture and the subject was asked to get up as quickly as possible 
after hearing the signal. In the protocols this was referred to as the * reaction- 
time procedure’. 4. The same as 1, except that in preparation for the move- 
ment the subject was asked to sit at his greatest sitting height. 


2.2.2. Standing 

The three postural images of ‘ best’, ‘most comfortable’, and ‘ greatest 
height ’ were recorded for the standing posture in the same sequence as for 
sitting. In addition, the subject was asked (a) to take a deep breath and 
hold it ; (b) to breathe out deeply and hold it ; (c) to stand on tip-toes. 


2.2.3. Subjective reports 

At the end of the session, subjective reports were obtained by means of a 
short questionnaire in which the subject was asked for his definition of ‘ best 
posture ’, whether he distinguished it from the other two, and whether he had 
had any special training in posture. 


§ 3. RESULTS if 


In answering the questionnaire, none of the subjects said he had had any 
special training in posture. Most of them were vague as to what they mean 
by ‘best ’, and there was no agreement as to which of the three postures wa: 
most efficient. By conventional standards their postures ranged from 
excellent to poor. In view of this disparity, their responses during the ex- 
periment showed remarkable regularity. 


3.1. Postural Records 


To analyse the photographs, the negatives were projected and a triangle 
diagram constructed by connecting the two points on the trunk with ea 
other and with the centre of the Frankfort Plane (Fig. 1). The plane itsel 
was then drawn in to form a bar at the apex of the triangle. Coordinates for 
the three points were determined, and the angles made with the horizontal by 
the cross bar (¢) and one side of the triangle (8) were measured. Means wert 


Neck-Muscle Tension and the Postural I mage ee 


btained for all of these measures. The final data were then replotted to give 
diagrammatic expression of the mean postures (Figs. 3 and 4). Means and 
andard deviations for the angles « and (which measure the head’s rotation 
nd its relation to the trunk) are given in Table 1. ¢-tests were made of the 
ifferences between the means in comparable procedures. Values of ¢ and 
gnificance levels are given in Table 2 


Table 1. Means and Standard Deviations. 


Procedure Angle (degrees) Muscle (tv) 
€ 7 Sternomastoid Upper 
Trapezius 
ae Mean SD Mean SD Mean SD Mean SD 
l. Sitting MC —13-64 16-96 35-72 14-99 7°16 2-38 15:05 10-77 
2. oe B — 1-42 10:57 50-48 9-17 10°34 6°55 944 7-55 
3. is GH 9714 10°98 56-94 7-41 25-88 18-02 9-62 5-50 
t. Lifting 2 kg MC —12-84 17:00 40-12 15:77 26-23 6-05 70°40 29-29 
a af B — 1-67 10°33 51-83 9-18 31:32 11-09 51:24 20-91 
). 35 GH 8°55 10°46 61-11 8:05 39-26 12-31 56-91 30-03 
i. Lifting 4 kg MC —10-64 21:59 41-25 19-82 46-23 9-88 122-97 34-88 
3. aa B 4-19 9-80 58-35 8-33 45-06 9°46 95°85 38°99 
), 35 GH 10°95 10:94 63-77 9:59 57-74 15-35 99:73 44-40 
). Ready to stand MC —24-35 22-68 25-11 18-96 7:35 4-83 21:35 14-29 
slow 
3 aR MC —23-44 24-41 24-56 21-04 10-15 5°22 21-55 14-17 
norm. 
ye 5 RT —23:-03 20:29 23-78 16-64 7:04 2°38 22-98 11-57 
3. a GH 7:08 9-73 55-61 5:93 20-51 18-48 6:27 4-26 
norm. 
t. Standing MC —14-71 21:65 41:01 16-49 7:69 4-22 13:10 8-76 
. xe B —3-16 16°51 51:06 13°07 15:39 11°19 12-98 11-69 
i as GH 5-95 11-327 57-81 6°41 30:56 22-74 12:20 6:59 
fs i TT — 2-56 7-82 47-29 11-55 16:03 8:87 7:81 3°86 
5 3 Inh. — 4-88 7:69 49-86 4-44 25-41 15:26 12°65 7-73 
). 55 Bxh: —17-:64 14-17 41-31 8:41 9:70 6°72 5:11 8°85 


3.2. HMG Records 

_A continuous sample of the EMG record, taken while the subject moved 
om his most comfortable sitting posture to his greatest sitting height, is 
own in Fig. 2. In the top line, the first mark indicates the signal to the 
bject to move ; the second mark indicates the signal from the camera 
utter as the picture was taken. The second line carries the direct record 
om the right erector spinae. The third and fourth lines carry the raw 
uscle and the integrator count from the right upper trapezius, the sixth and 
venth from the right sternomastoid. Calibration records for 30 and 100 pv 
e shown on the right. It should be noted that the two integrators follow 
fferent scales. 

Because the signal from the heart was so large in the lead from the erector 
inae (amounting to 30 uv or more) we did not use the integrator score from 
is muscle in our calculations. (It might be noted that the behaviour of the 
ector spinae did not distinguish the three postural images as did that of the 
o neck muscles.) In the neck leads, the heart played a much smaller role, 
nging, according to our best estimate, from 63 uv down to almost nothing. 

Mean muscle activity was calculated by counting the number of pips in 
e integrator record over a 5 sec period and converting this number, by 
sans of calibrating signals, to the average equivalent in peak-to-peak j.v/sec. 


138 


F. P. Jones, F. E. Gray J. A. Hanson and J. D. Shoop 


The final figure was obtained by subtracting the estimated ECG from the pv 


total. 


The behaviour of the two neck muscles in the various procedures is shown 
diagrammatically in Figs. 3 and 4. Means and standard deviations for both 


muscles are given in Table | ; 


are given in Table 2. 
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Fig. 2. Sample electromyogram. Movement from ‘most comfortable sitting posture’ to 
‘ greatest sitting height ’. 
Table 2. Differences between Means: t values and significance levels. 
Muscle 
Procedure € 7] Sternomastoid Upper 
Trapezius 
t sig. t sig. t sig. t sig. 
lvs2 3:09 p 0-01 4:19 p0-01 2:42 p 0-05 3:95 p 0:05 
2vs3 4-61 p 0-001 3:92 p 0-01 3-77 p 0-01 
lvs3 6:84 p 0-001 6:55 p 0-001 4:03 p 0-01 1-72 § 
lvs 10 2°39 =p 0-05 2-95 p 0-05 2:66 p 0-0) 
lvs 11 1-87 254 p 0:05 2°37 p 0:05 2-34 
lvs 12 1:75 2°51 p 0:05 2°22 
4vs5 2-14 2:28 p 0-05 1:71 3-92 
5 vs 6 5-01 p 0-001 4-61 p 0-001 2:84 p 0-05 
4vs6 4-38 p 0-001 4:77 p 0-001 3-73 p 0-01 
Tvs8 2-62 p 0:05 3:12 p 0-01 0-32 1:53 
8 vs 9 4:42 p 0-001 3-54 p 0-01 3°60 p 0-01 
Tvs9 4:55 p 0-001 4-66 p 0-001 3:03 p 0-05 
14vs15 1-52 1-65 3°65 p 0-01 
15 vs 16 3°50 p 0-01 2-72 p 0-05 3-76 p 0-01 
14 vs 16 3:30 p 0-01 3-49 p 0-01 415 p0-01 
18 vs 19 3°33 p 0-01 4:03 p 0-01 4-68 p 0-001 


Yor ¢, 6 and the Sternomastoid, n=14; for the Upper Trapezius, n=7. 
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§ 4. Discussion 
The triangle diagr ; P 
Br of the 1 ' : grams and the bar graphs provide succinct definitions for 
HE indices (£ pos ae fee A glance will show how relationships among 
ices (four postural an ” : : 
nother. two muscular) change from one response to 


PROCEDURE POSTURAL 


SITTING 


SITTING - 
LIFTING 
2 KILOS 


SITTING- 


LIFTING & 
4 KILOS A 8 


SITTING- 


20 40 60 80 20 40 60 80 WO 120 40 
MICROVOLTS 


Fig. 3. Mean sitting postures (procedures 1 to 13). 
MC=most comfortable ; B=best ; GH=greatest height ; RT=reaction time. 


4.1. The Nine Sitting Procedures 


The three postural images are sharply defined by the diagrams. In terms 
the angles, « and 6, each differs significantly from the other two at the 
01 level or better. ‘Greatest height’ is the most stereotyped of the three 


a 
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responses. The chest is lifted and the head rotated backward at much the 
same angle in all of the subjects. ‘Best’, which falls between the two extremes, 
shows more individual differences. The greatest variation, however, is found 
in ‘most comfortable’, which ranges from almost completely erect (8=55") 
to almost completely collapsed (@ =1-5°). 

In the two lifting procedures, the three images are differentiated as before, 
with the greatest variation again in the most comfortable. Doubling the load 
makes little difference, except for a slight increase in « which appears in each 
of the three postures. 

A third measure which sharply differentiates the three postural responses 
is the behaviour of the sternomastoid muscle, whose potential increases as the 
subject moves from most comfortable to best and from best to greatest sitting 
height. The ability of this muscle to differentiate posture is retained in both 
of the lifting procedures, though its overall activity has increased greatly as a 
result of the lifting itself. An analysis of variance for the nine procedures 
showed that the postural differences were significant at the 0-01 level. 

The postural activity of the upper trapezius followed a different (and often 
an opposite) course from that of the sternomastoid. It failed to distinguish 
the two erect postures from each other, but when the subjects moved from 
best to most comfortable, activity in the upper trapezius increased significantly. 
In this its behaviour is reciprocal to that of the sternomastoid. The reciprocal 
relation between the two muscles persists in the two lifting procedures, though 
in the 4 kg lift the difference in trapezius activity is not statistically significant. 
What the upper trapezius seems to be registering here is the postural change— 
the forward thrust of the head—that accompanies the subject’s attempt to 
relax in assuming a more comfortable posture. To test this possibility we 
asked one of the subjects to relax still further. When he complied there was 
a further increase in trapezius activity. 


4.2. The Four *‘ Ready’ Procedures 

The effect of anticipation on the postural pattern can be seen most clearly 

in the reaction-time procedure (procedure 12). In getting ready to stand, 
the subjects regularly moved forward and down, increasing the forward thrust 
of the head. Though less marked, the pattern can still be seen in procedurest 
10 and 11, for which the subject was asked as a preliminary to sit in his mos 
comfortable posture. Angle @ in both of these procedures differs significantl 
from the same angle in procedure 1, where there was no anticipation of mo 
ment. Only in procedure 13, where the subject was asked as a prelimina 
to sit at his greatest height, was this element of anticipation controlled. Th 
pattern of 13 does not differ significantly from that of 3 and the two angles 
are virtually the same in both procedures. 
It might be expected that anticipation of movement would produce a 
general increase in neck-muscle tension, but the expectation was not confirme 
by our data. In four of our subjects, it is true, the sternomastoid showe 
considerable increase before the normal movement. But in the other three 
there was no increase at all, and before the slow movement and the reaction 
time procedure (where an increase might well be expected) the mean actual 
decreased. The upper trapezius, which showed a slight but consisten 
mecrease, was probably reflecting the changes in posture only rather tha 
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anticipation of movement. It is significant that when the postural change 
as controlled there was no increase in either muscle. 


4.3. The Six Standing Postures 

The three postural patterns (most comfortable, best, and greatest height) 
re essentially the same for standing as for sitting. The most sensitive index 
the sternomastoid muscle, which distinguishes all three of the responses at 
1e 0-01 level. For the upper trapezius, on the other hand, the means were 
irtually the same for the three procedures. 

Standing on tip-toes produced a pattern almost identical with that of best 
janding posture. The response was more uniform, however, the standard 
eviation being smaller in all four indices. 


PROCEDURE POSTURAL STERNOMASTOID | TRAPEZIUS 
DIAGRAM 20 40 60 8 20 _ 40 60 80 100 120 180 


STANDING 


STANDING ~ 
TIPTOES 


STANDING- tah. 


EEP 


D 
BREATHING 19 
Ss Exh. 
MICROVOLTS 


Fig. 4. Mean standing postures (procedures 14 to 19). 
TT=tip-toes ; Exh.=exhale ; Inh.=inhale. 


— — 
20 4060 8 20 40 60 80 100 120 140 


“In the two breathing postures, the sternomastoid provides the best criterion 
ignificant at the 0-001 level). Potential in this muscle increases in inspiration, 
creases in expiration. The upper trapezius shows no significant change in 
her procedure. The sternomastoid is one of the accessory muscles of 
piration (Gray 1948, p. 391) ; it helps to lift the rib cage in forced inspiration, 
ortening to do so, it appears to tilt the head back as well, since there is an 
rease in ¢ that is significant at the 0-01 level. (Superficially, the ee 
- inspiration and expiration resemble ‘best’ and * most comfortable 

pectively. The standard deviations for the two angles are much smaller 


the breathing postures, however.) 


§ 5. CONCLUSION 
In this study we have not attempted to define good and poor posture or 


set up any positive standards on which such definitions might be based. 
aim was to find out something about postural behaviour and estimate 
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the extent to which this behaviour can be described in terms of stimulus-and- 
response. Though we were obliged to work with a small number of subjects, 
we believe that they were representative of their age and educational back- 
ground. Within this group there was a wide range of postures, judged by 
conventional standards, but postural responses showed a high degree of 
regularity. The patterns, which seem to correspond to subjective images of 
comfort, correctness, and height, can conveniently be defined in terms of the 
relative position and angle of the head and the differential activity of two 
muscles in the neck. 


On a étudié “‘ image posturale ” de ce qui constitue une position confortable, une position 
correcte, et une position de hauteur la plus grande. Les sujets étaient sept jeunes hommes 
entre seize ans et vingt-et-un ans. On a représenté cette image comme réponse immédiate a 
un stimulus spécifique et verbale. On a enregistré la réponse simultanément par la photographie 
et ’électromyographie ; et on l’a mesurée par les rapports angulaires entre la téte et le tronc et 
par les changements électriques dans le muscle sternocléidomastoidien et dans la portion 
supérieure de trapéze. On a enregistré les réponses dans la position assise et dans la position 
debout. En outre, on a étudié les effets de l’effort du levage ; des mouvements anticipés ; de la 
respiration forcée ; et de la position de se tenir a la pointe des pieds. 

On a traité les résultats de ’expérience avec les statistiques. 


Man untersuchte bei sieben mannlichen Personen im Alter von 16 bis 21 Jahren die innere 
Haltungsbilder fiir Bequemlichkeit, Richtigkeit und héchster Statur. Das Bild wurde jeweils 
als unvermittelte Reaktion auf ein spezifisches Reizwort photographisch und electromyographisch 
gleichzeitig aufgezeichnet. Die Reaktion liess sich quantitativ bewerten durch Messung des 
Winkels zwischen Kopf und Rumpf, und am electrischen Spannungswechsel im oberen Trapezius- 
und in den Sternocleidomastoideusmuskeln. Die Bilder wurden beim Sitzen sowie beim Stehen 
aufgezeichnet. Dazu-—untersuchte man die Einwirkung auf Kérperhaltung bei Anstrengung 
(Heben), Sichrichten auf Bewegung, Tiefatmung und bei Stehen auf den Zehénspitzen. Die 
Daten wurden einer statistischen Behandlung unterzogen. 
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FATIGUE AND DRIVING 


By A. CrawForD 


Road Research Laboratory, Department of Scientific and Industrial Research, 
Harmondsworth, Middlesex, England 


The problem of fatigue in driving is two-fold. It includes both the fatigue 
resulting from driving and the effects of fatigue, from whatever source, on driving. 
The study of fatigue in drivers is hindered by the fact that no reliable method of 
measuring fatigue or defining driving performance has yet been found. This 
paper reviews some of the research on operational fatigue, and discusses the source 
of fatigue in driving. It is concluded that stress is an important factor in the 
production of driving fatigue, and various methods of assessing the effects of 
the emotional arousal it produces are discussed. 


§ 1. INTRODUCTION 


WRITERS on the subject of fatigue have almost always found difficulty in 
arriving at a definition. The problem arises partly from the breadth and 
variety of the word’s meaning in common usage and partly from the relatively 
small amount of direct research on the subject. Many of the effects of fatigue 
have been thought to account for the failure of experiments on other matters. 
Although some writers (Muscio 1921, Browne 1953) have questioned the 
acceptability of ‘ fatigue ’ as a subject for study, itis agreed without exception 
that the effects are in themselves important and worthy of study. Floyd 
and Welford (1953) estimate that the cost of fatigue allowances in industry 
amounts to tens of millions of pounds per year. The contribution of fatigue 
to accidents is not known, but the nature of its effects leaves little doubt that 
the cost to the nation from this source must also be very high. 

Many writers, including Bartlett (1953) and Broadbent (1953) define fatigue 
as deterioration in an activity as a direct result of being engaged in it. This 
definition is too narrow for the present purposes because it does not include 
the possibility of drivers being fatigued from other causes when they start 
to drive. That such a ‘ generalized ’ fatigue is possible has been demonstrated 
in experiments which seek to find objective tests for fatigue such as an American 
study (Jones et al. 1947) of the effect of different periods of driving. In this 
experiment long-distance truck drivers were stopped and tested as they 
approached the terminus of a trip; the test results showed a general decline 
in several measures of psychomotor performance after long periods of driving. 
There are, therefore, two interrelated aspects of the practical problems of 
driving and fatigue, namely the fatigue arising from driving—operational 
fatizue—and the effect of fatigue, from whatever source, on driving. Whilst 
it may be possible to reduce the former, simply by cutting down the number 
of hours of driving, the effect of the latter requires study. For this, however, 
an objective measure of fatigue is necessary. 


§ 2. OPERATIONAL FATIGUE 


4 2.1. Driving 


It is generally agreed that performance can be impaired by driving for 
too long a period but it has proved extremely difficult:— 
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(i) to define what is meant by driving performance, 

(ji) to develop adequate techniques of measuring it, 

(iii) to interpret the significance for fatigue of any signs of deterioration 

in performance, 

(iv) to define the amount of deterioration which might reduce safety. 

Measurement of discrete or isolated reactions throws little ight on the 
deterioration of a complex skill. In order to examine changes in the patterns 
of a skill, techniques must be employed which, so far as driving is concerned, 
are at present-only in the process of development. However, given these 
techniques, the most difficult problem still remains, that of administering the 
tests under conditions of tension and motivation comparable with those under 
which the skill is usually performed. Although research on ‘ fatigue in general ’ 
has been exploratory and inconclusive, Welford (1953) can at least say of it 
that “the way has been cleared for a definite attack on the problem ” and 
that we have reached the “end of the beginning”. Unfortunately this is 
not the case with driving fatigue where, since a basic knowledge both of 
fatigue and of driving performance is lacking, the beginning has not yet been 
reached. Nevertheless, a number of investigations have shown interesting 
results. 

Reference has already been made to a study of the relationship between 
the number of hours of driving and certain psychological and physiological 
measures (Jones ef al. 1947). This investigation covered 889 interstate truck 
drivers from three cities who were stopped just before reaching their destina- 
tion, taken from their vehicles and tested. The results were compared with 
those for a control group, matched as to hours of sleep, who had done no 
driving. Of the measures used, hand steadiness, body sway, speed of tapping, 
and what the authors call ‘ reaction co-ordination time’ showed the greatest 
differences. There was a more or less consistent decrease in psychomotor 
performance with increase in the number of hours driving, but there was no 
sharp break in the results which might have indicated a particular moment of 
onset of fatigue. The value of the research was limited by the fact that the 
validity of the tests as predictors of poor driving or accidents was not known. 
However, the authors conclude that long hours of driving as well as the off-duty 
activities of the operators reduce ‘efficiency’ and therefore the safety of 
driving. A similar but more intensive experiment reported by Ryan and 
Warner (1936) on six drivers tested after one hour and ten hours of driving 
also showed a demonstrable effect on steadiness, co-ordination reaction time 
and some other measures. The value of this experiment was also limited by 
the use of non-validated measures. Lauer and Suhr (1958), in attempting to 
validate laboratory experiments on driving by comparing their results with 
those of similar tests made during actual driving, showed that frequent rest 
pauses do much to reduce deterioration of performance due to prolonged 
driving. 

In another type of investigation (McFarland and Moseley 1954), long hours 
of driving were imposed to bring about the fatigue condition, and the effect 
on driving was then noted. In this study the activities of two bus drivers on 
regularly scheduled passenger runs were directly observed and classified. 
Comparison of performance during the first and last half hours of the drive 
(the duration of the drive is not stated but may have been 35 hours) showed 
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onsiderable reduction in the number of steering wheel movements in the 
ast half hour (steering movements were made about 10 times as often as all 
ther recorded movements together). A breakdown of the data into the 
umber and kind of activities recorded for the first five minutes of each of 
even successive half hours showed no significant trends in any of the measures. 

A further observational type of experiment, and one which appears to 
rovide a very promising line of research, is the study of near accidents. An 
xperiment of this type is that reported by Potts (1951) and by McFarland and 
Hoseley (1954), in which seventeen long haul truck drivers on a total of 20 trips 
weraging 250 miles per trip, were accompanied by an observer. In all, 
‘8 near accidents were reported by the observer, 46 per cent of them occurring 
vithin the first 2 hours and 67 per cent during the first 4 hours: in 72 per cent 
he trucks were travelling at 35 m.p.h. or faster. The numbers reported in 
ach hour of the trip are shown in the following table:— 


Hour of trip 1 2 3 4 5 6 i 8 9 


Number of near accidents 11 11 6 4 4 4 4 3 1 


These results, if they are valid and reliable, indicate that an increase in the 
lriving period, far from being detrimental, leads to a reduction in near accidents. 
[his startling finding is partly supported by a number of American studies 
yn long haul truck accidents. McFarland and Moseley (1954) report that the 
vecident prevention departments of two large insurance companies found that 
ipproximately 60 per cent of all long haul truck accidents in 1949 occurred 
luring the first 33 hours of driving; they also found that ‘driver asleep ’ 
ecident statistics indicated that the most dangerous time is 4 to 5 hours after 
he last sleep, and that the peak of the ‘ driver responsibility ’ accidents to 
terstate commercial drivers in 1938 occurred after 5 to 6 hours on duty. 
nfortunately, all these data are open to the suspicion that they are reflecting 
he period of exposure of drivers to risk of accidents, and that before reliable 
ssessments can be made it is necessary to know the distribution of the numbers 
f hours driven. These authors suggest that, as well as lack of rest, anxiety 
nd other emotional states due to off-duty interpersonal relationships con- 
ribute to the pattern of accident behaviour. In other words, it appears that 
ansferred fatigue has at least as important a part in accidents as operational 
tigue. 

Three ‘hypnotic’ effects of long monotonous journeys have been noted 
y American investigators (Haber et al. 1954). Most of the evidence for them 
; anecdotal but it is of sufficient volume to merit serious consideration. 
the effects are:— 

(i) The driver’s inability to appreciate his speed in terms of stopping 
distance when travelling at high speeds or when slowing to enter a 
restricted area. 

4 (ii) A state of ‘trance’ brought about by traversing mile after mile of 
monotonous highway. For British drivers this is perhaps an experi- 
ence to be expected in the future. 

(iii) Hypnagogic hallucinations (McFarland and Moseley 1954), in which, 

after driving long distances, drivers imagine they see something on 
the road, and make emergency stops: the driver carries out the 
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emergency stop, sometimes driving off the road, without recognizing 

that the situation is not real. The phenomenon occurs typically at 

night, while the vehicle is moving but while the driver’s activity Is 

at a low level. The content of an hallucination is most probably 

related to the driver’s individual experience, but the frequency of 

occurrence seems to be very high in certain groups. All 33 of a group 

of long haul truck drivers interviewed reported having experienced 

these hallucinations, mainly at night (McFarland and Moseley 1954), 

whilst none of a group of 20 local truck drivers had done so. These 

drivers thought that the frequency of the hallucinations had been 

reduced considerably after the introduction of a shorter working day, 

Unfortunately very little is known of the conditions producing these 

phenomena and no clear idea of their prevalence seems to have been formed 

by the investigators; their importance to the problems of long hours of driving 

in Britain is therefore unknown, but it is thought to be less than in the U.S.A, 
where distances driven are much greater. 


2.2. Complex Skills 


Since none of the experiments on driving show clearly the effect of fatigue 
as a result of driving, it may be useful to study the results of experiments on 
fatigue in relation to similar tasks. The most relevant work is that on 
operational fatigue and instrument flying, and Drew’s conclusions (1940) are 
classical in this field. 

Drew studied instrument-flight performance with a standard aircraft 
cockpit and simulated controls. Experimental sessions of 2 hours duration 
were divided into seven equal periods of intensive activity interspersed with 
three similar periods when, though complete relaxation was not possible, level 
flight was simulated; for some subjects the tests were extended to 6—7 h and 
their results confirmed those of the 2h tests. Motivation and interest are 
said to have remained high throughout each test. There were six primary 
flying instruments, and when the subjects were fresh these were followed as ar 
interconnected whole. Towards the end of the session, however, reaction: 
to correct the individual misalignments shown on the instruments became 
more automatic and conventional, in the sense that the subjects used thi 
kind of interpretation which first occurred to them regardless of correctness 
and became increasingly willing to accept lower standards of accuracy ani 
performance. The instruments outside the immediate range of attention, a 
the edge of the display, were given less attention, and by the end of 2 h 60 pt 
cent of the subjects paid no attention to them at all. The timing sco 
improved initially, but, as the session progressed, began to deteriorate rapid 
Finally, whenever the subjects were informed they were in the last stage of 
run they showed an ‘ irresistible tendency to relax when home is in sight’. 

Bartlett (1943, 1951, 1952, 1953) has summarized his conclusions from h 
own and succeeding experiments (e.g. Davis 1948) under the heading of ‘ 
Fatigue’. He distinguishes between fatigue resulting from hard phys 
work and fatigue from work calling for little muscular reaction but requi 
persistent concentration and a high degree of skill. According to him, skill 
tasks, such as driving, require complex, co-ordinated and accurately time 
activities as well as simple repetitive movements. One essential characteristi 
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f skill fatigue is that although the subject is physically capable of performing 
he desired task, he does not actually carry it out correctly unless he takes 
articular care. 

With the onset of skill fatigue, the subjects’ ‘ timing ’ is affected (Bartlett 
953); parts of the cycle of operations are occasionally slowed and other parts 
ushed to compensate. At a second stage subjects ignore some of the elements 
1 the task. It might be said that the field of display is ‘ split up’ according 
o the proportion of signals ‘ seen’ or ‘ unseen’ in its various parts: outlying 
lements are likely to be either ignored, as Drew demonstrated, or stressed 
t the expense of the closely knit, central elements, and in consequence correct 
ctions may be performed but at the wrong time, while other actions are 
mitted altogether. At a third stage aches and pains are noticed; their 
dealization is unimportant but, in common with many elements of the total 
ituation previously ignored, they reach consciousness and command response. 
sartlett concludes that fatigue may be seen as a progressive widening of the 
eld of stimuli to which response is made, so that the subject’s actions lose 
irection. Insight deteriorates fairly early and personal standards are lowered. 
\ctions the subject would normally consider hazardous are undertaken with 
ttle concern when he is over fatigued (Davis 1948). 

It is difficult to estimate how long it will take for each of these stages to be 
eached during driving. It seems unlikely that many drivers will advance 
ar in to thesecond stage without some incident occurring to stop them driving, 
r at least to make them alive to the dangers, and change their level of arousal. 

Broadbent (1953) argues that a self-paced task is not likely to show the 
rrors of timing characteristic of the first stage of skill fatigue, but that if the 
bject’s working pace is dictated by the task he will be unable to compensate 
r momentary lapses and the task will be sensitive to fatigue. The ‘ clock 
st’ reported by Mackworth (1950) provides a task of the latter type: it 
quires the subject to watch a moving pointer and to report its very infrequent 
ouble jumps, with no possibility of compensating for a momentary lapse of 
tention in which a signal is missed. Experiments with this test showed a 
ther rapid decrease in performance despite the small amount of overt action 
quired. This type of vigilance task seems to be one in which quick deteriora- 
on is to be expected. In considering the way in which Barlett’s concept of 
ill fatigue may be expected to apply to driving, it is at present a matter for 
njecture how often the pace of driving is sufficiently forced for momentary 
pses to assume importance. 

While laboratory experiments of this kind may indicate how the perceptual 
ills involved in driving are affected by fatigue, they cannot reproduce the 
srsonal contact between drivers, the competitiveness and the frustrations of 
e real-life situation. On the road a driver is trying to remove himself from 
e present situation and to reach his destination, probably as quickly as 
ssible. Such conditions lead to emotional responses which may dominate 
s behaviour and affect the rate of deterioration of his performance. 


§ 3. THe Source or FATIGUE 


There is a vast difference between the effect of repeated contractions of a 
scle in response to stimuli from electrodes placed directly on it and the 
ndition of irritability, unusual sensitivity and irrational stubbornness, which 
va 
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is typically observed in those who have been engaged for a long period in 
sedentary work. Both conditions reduce efficiency; both are called fatigue. 
Between them lies a range of changes in the body chemistry and in overt 
behaviour the degrees of which seem to depend on the extent to which the 
central nervous system is involved in the task. That is to say, the more 
integrated the behaviour and, consequently, the more varied the possible 
sources of stress, the more gradually will the changes occur, the less will they 
depend on specifically physical effort, and the harder will they be to detect. 

Except among truck drivers, who may man-handle their loads, gross 
musculature is not used to any substantial extent while driving, and drivers 
are not found to suffer from the symptoms of fatigue produced by severe 
exercise. On the other hand stresses may develop in heavy urban and holiday 
traffic, or in bad weather, or when road surfaces are difficult. In heavy traffie 
the driver’s pace of driving is determined by the density and speed of the 
traffic stream and his willingness to conform. With the present road system 
the pace of driving is likely to be most firmly controlled by a traffic stream whose 
speed is usually fairly low, although sometimes rather higher: for example, 
along the first 20 miles of some of the main roads out of London the speed is 
high for the density of the traffic. It seemsJikely that there will be an increase 
in driving stresses from this source on the busier parts of the high speed roads 
of the future. 


3.1. Stress 


Emotional stress can have serious physiological results in subjects carrying 
out tasks in which there is little or no chance to correct errors; and stressful ” 
activities which do not lead to exhaustion—such as motor racing—produce 
severe depletion of energy reserves shown to be associated with deficiencies in 
hormone production of the adrenal cortex (McFarland and Moseley 1954). 
Studies of the galvanic skin response during driving have shown that there is 
considerable emotional response in traffic (Michaels 1960, Hulbert 1957), 
particularly when vehicles are turning, crossing, or merging, or are in close 
proximity to the test vehicle. A mild emotional response is not harmful 
provided that the driver is in reasonably good physical health, but repeated 
emotional arousal over a relatively short interval of time will raise a driver’s 
general emotional level as one response builds on the arousal level of the 
previous one. As a result, an otherwise unexpectedly strong response may be 
evoked by a very slight irritation. Such behaviour is described as ‘ over 
sensitive ’ and the subject as ‘ fatigued ’. 

The second result of a sustained state of emotional arousal in a driver 
over a short period, is that for some time afterwards it is more difficult thar 
usual to elicit an emotional response. This may be regarded as anothe 
example of adaptation to stimulation; and in this case it will protect the drive 
from the severe depletion of energy reserves mentioned above. The effec 
may be described as a raised threshold of arousal. It will depend for its extent 
on the previous arousal level, and it will appear subjectively as depressio 
and loss of interest in the environment, and behaviourally as a reduce 
intensity or vigour of response to it. 

One of the commonest sources of emotional arousal and stress is inte 
personal relationships, and it is important to assess how much the stress frc 
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such sources contributes to the stress from traffic conditions. Whilst the 
stress of unresolved personal conflicts, fear, or anxiety is commonplace, the 
casual fleeting relationships with other drivers in the various traffic situations 
will also be a source of emotion to a varying degree, depending upon the 
driver’s personality and temperament. Arousal from one set of causes can be 
added to that from another, as described above, and the one will always reduce 
Success in overcoming the other. In extreme cases, when the strain is so 
strong as to interfere with sleep, a condition of chronic fatigue may develop. 
Such a condition follows prolonged periods of unpleasant emotion, and one 
of the best cures for it is physical exertion. 


3.2. The Sleep Mechanism, Emotion and Arousal 


In order to understand fatigue, then, it is necessary also to understand 
emotion and arousal. In view of this it will be no surprise that in recent 
assessments emotion, the arousal state, and the sleep mechanism are linked 
(Lindsley 1951, Magoun 1958). They are considered as different aspects of a 
single system having one continuum from deep coma via somnolence, states 
of relaxed wakefulness, bodily tension (aroused reflexly through lower levels 
of the spinal cord and the brain stem to unexpected and persistent external 
stimulation) to extreme excitement and emotional disturbance with loss of 
control and disruption of skilful activity. 

Experiments have sometimes shown that there is very little deterioration 
in performance with loss of sleep. For instance Warren and Ciark (1937) 
showed that after 65 h without sleep there was little reduction in performance 
level even though the subjects had to struggle to keep awake. One explana- 
tion of this is suggested by Laslett, who had no difficulty in securing volunteers 
for a similar experiment on lack of sleep, but had to prevail on his friends to 
ct as subjects in the control series, with normal amounts of sleep (Seashore 
957); he concluded that the motivation of being in a difficult experiment 
elps to compensate for any deleterious effects of sleep loss. : 

Deterioration of performance resulting from lack of sleep is reflected in 

serial or continuous tasks like driving in traffic rather than in tasks which call 
or single actions and in which there are thus opportunities to rest. Subjects 
an arouse themselves momentarily, but if the test calls for continuous perform- 
nee there is difficulty in maintaining arousal even for trials lasting a few 
inutes (McFarland and Moseley 1954, Seashore 1957). Many of the 
ehavioural effects of forced wakefulness are of the same type as noted by 
roadbent (1953) for vigilance tasks with rested subjects. 
Malmo (1957) has shown that the experimental evidence supports the idea 
an arousal or intensity dimension of behaviour as suggested by Lindsley 
1951) and others, notably Duffy-(1951), Hebb (1955) and Schlosberg (1954). 
‘he experiments, which were concerned with the relationship of physiological 
easures to the level of motivation (e.g. Stennett 1957, Survillo 1956), indicated 
hat the measures may serve to quantify this dimension of arousal. 


§ 4. Some Possrpte Meruops or Muasurine ‘STRESS > FATIGUE 
In the introduction it was suggested that the problem of fatigue would be 
larified if an objective measure of fatigue could be found, and in the last 
ection it was suggested that a raised threshold of arousal following a period 


150 A. Crawford 


of stress is an alternative description for the kind of fatigue produced by 
driving. This section reviews the possibility of measuring the level of activa- 
tion and the threshold of arousal. 

In view of the fact that driving fatigue is identified with a raised threshold 
of arousal, the arousal value of any experiments or tests must be regarded as a 
parameter and either controlled or allowed for in the experimental design. 
For instance, if it is found that the subject’s interest in the test itself is short- 
lived when he is fatigued but sustained for much longer periods when he is 
fresh, this will give a measure of the fatigue. A test has arousal value princi- 
pally because it focusses the attention of the subject, but there are other 
contributory factors such as the personal satisfaction found in agreeing to 
co-operate, in the competitiveness, and in the self-knowledge gained from the 
results. For these reasons, until basic research on tests for fatigue is under- 
taken, experiments must be conducted as far as possible without the subject’s 
knowledge. 


4.1. Physiological methods 


There can be no doubt that the most satisfactory index of fatigue would 
be a physiological measurement of some description. However, considerable 
practical difficulties in the recording techniques limit their usefulness at 
present. 

The electroencephalogram has been shown by a number of writers (Lindsley 
1951, Crawford 1955, Lindsley et al. 1949, Moruzzi and Magoun 1949) to 
present an activation pattern which may be found to give a measure of effort 
and of a subsequently raised threshold of arousal. Muscle spikes from 
electrodes placed on the pinna of the ear and from saline-moistened pads placed 
just above the eyebrows were found by Kennedy (1953) to vary with periods 
of alertness on a monotonous vigilance task. The change in electrical skin 
resistance (GSR) with sweating has been used for many years as a measure of 
emotional response. This method is regarded by Darrow (1936) as at least 
as good as any other measure, but it has the disadvantage that while changes 
are easy to record reliably, the absolute level of sweat gland activity is very 
hard to estimate. The list of current physiological measures of emotional 
arousal also includes blood pressure and volume, electrocardiogram and heart _ 
rate, respiration, skin temperature, pupilliary response, salivary secretion, 
pilomotor response, chemical sweating indices, analysis of blood, saliva and 
urine, metabolic rate, muscle tension, eye blink, and tremor. Of these, each is 
affected to a greater or lesser degree by sympathetic nervous activity but also’ 
by subsidiary reflex factors which cannot be fully controlled experimentally. 
At present, therefore, it is not possible to make use of the absolute values of 
physiological measures. Perhaps the most sensitive of them could be used. 


* 


with a standard arousing stimulus, and in this way the threshold of arousal 
gauged. 

Schnore (1959) has examined the change in a number of these measures 
following stimulation under standard conditions and found not only that there 
were individual variations in arousal threshold but also that the measure 
showing the greatest change differed from subject to subject. However 
Schnore makes the point that, despite the idiosyncratic differences in the 
patterns of results, any individual placed in an arousing situation will show an 
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nerease in most functions even though his general level may be different from 
he next person’s. Comparisons made with these measures must therefore be 
f the amount of change in level of arousal in members of a single group of 
ubjects: the use of a control group is not applicable. Hy 

Apart from the disadvantage of the need for electrodes or their equivalent 
onnected to the subjects, and for sensitive amplifiers and recording equip- 
nent, it appears that physiological methods in general are more likely to be 
iseful in indicating the changes in emotional stress during a task rather than 
he cumulative effect of it in the form of fatigue. However, there are signs 
hat the basal level of arousal may be identified, for example in comparing 
nxious with hysterical subjects by measuring the sedation threshold (Claridge 
959, Shagass 1954). Practical application of the method to drivers might 
erhaps be approximated by giving them a sedative in a cup of tea after a 
yeriod of driving and measuring the time before it takes effect. 


4.2. Psychological methods 


Mackworth’s clock test, referred to in section 2.2., is an example of the 
ort of measure which may prove sensitive to a general level of fatigue. Other 
xamples are Kay’s (1955) test of immediate serial recall, and the solving of 
airly difficult unfamiliar problems, such as those used by Welford et al. (1950). 
n Kay’s test, subjects were required to press buttons in response to the 
yppearance of light signals, the correct button to be pressed being the one 
wssociated with, say, the third from most recent-signal. This test proved 
ensitive to fatigue, and there were differential effects with age. In Welford’s 
woblem-solving experiments the significant finding was that ‘ fatigued ’ 
ubjects new to the task were less successful than those tested for the first 
ime when they were fresh. The first group when tested after recuperation 
till showed a poorer performance than the second group retested when they 
ere tired; the differential effect was thought not to be due to differences in 
he ability of the groups. ; 

Perhaps the most sensitive tests are likely to be those which call, at least 
omentarily, for a high rate of arousal, and in which there is no opportunity 
r the subject to compensate for errors nor time for him to arouse himself if 
eis not ready. Various tests of this kind would be relevant to driving, but 
e tasks should be paced and should contain central and outlying elements, 
that lowered standards of performance would show up. Preferably, the 
sks should also be easily administered without the driver having to leave the 
heel, but, presumably, with the vehicle halted. As a corollary to this type 
method, fatigue might be measured by increasing a driver’s work load by 
igaging him in a secondary task while driving: an example of this technique 
hich required drivers to add digits or to listen to a string of numbers and 
otice when there was a repetition has been reported by Brown and Poulton 
1961) as a test of driving stress, although they did not attempt to use it to 
easure fatigue. When more than usual attention is required for driving, the 
or score on the secondary task may be expected to increase; Brown and 
oulton found their task to be sensitive to traffic, and it is possible that a 
milar arrangement or an added secondary task would be sensitive to fatigue. 
fowever, it might serve to reduce fatigue by contributing an added interest 


the task. 
oe 
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It is generally accepted that with the onset of fatigue the apparently less 
important aspects of a task will be sacrificed to keep the load of the task within 
the subject’s capabilities. Thus among sedentary workers in their day-to-day 
existence the social courtesies are sacrificed first. In driving, these courtesies 
take the form of giving way to other drivers and to pedestrians, giving clear 
signals and, at night, dipping headlights. Reduction in these courtesies and 
increase of activities such as impatient horn blowing may be expected after 
long hours of driving. They may perhaps be used as measures of fatigue. 
Headlight dipping and horn blowing could be recorded automatically and on 
a large scale with simple apparatus and no-one in attendance. They bave the 
advantage of not changing the task from driving to a special test, nor adding to 
it nor interfering in any way with comfort and normal driving habits. Such 
tests might, however, be difficult to arrange as it might not be possible to 
control the conditions and the measures would lose some sensitivity, and 
perhaps not show the effect at all, if the drivers had the added interest of 
knowing they were being tested. 


§ 5. CONCLUSIONS 

Fatigue resulting from prolonged activity at one task may be transferred 
to and measured by another. There are therefore two aspects to driving and 
fatigue. First it must be asked whether, and to what extent, driving leads to 
fatigue and secondly how much fatigue affects driving. It is entirely possible 
that driving creates a great deal of fatigue but, fortunately, is not very sensitive 
to it—or conversely that driving performance is very sensitive to fatigue but 
that fatigue from driving is limited. Research is required to distinguish 
between these possibilities and the results will indicate future research and’ 
administrative policy. At present a major difficulty in the way of such research 
is the lack of knowledge both about driving performance and about fatigue. 
Experimental techniques which do not yet exist or are only now being developed 
are required to study these basic problems. 

It seems that the type of fatigue arising from driving is very different from 
that produced by physical exercise, and that it is caused largely by the stresses 
arising from traffic and other conditions, which produce varying states of 
emotional arousal. Repeated emotional arousal over a short period results in 
oversensitive behaviour showing strong responses to slight irritations, and if 
the period is prolonged, this is followed by a state of ‘lowered vigour’ in 
which there is a reduction in intensity of response to the environment and a 
raised threshold of arousal. Either of these states is expected to increase the 
risk of accident both to the subject and to other drivers. 


This paper is published by permission of the Director of Road Research. 


Le probleme de la fatigue du conducteur de voiture revét un double aspect. D’une part, 
il y a la fatigue due au fait de conduire, d’autre part, il y a les effets de la fatigue—d’ou qu’elle 
vienne—sur la conduite. L’étude de la fatigue chez les conducteurs est rendue difficile par 
Yabsence actuelle d’une méthode fidéle d’évaluation du degré de fatigue et de critéres de la 
performance lors de la conduite. 

Cet article rapporte un certain nombre de recherches concernant la fatigue opérationnelle et 
fait porter la discussion su l’origine de la fatigue chez le conducteur. Il apparait, en conclusion, 
que le stress est un facteur important dans la genése de la fatigue du conducteur. On fait 


pice critique des diverses méthodes qui permettent d’évaluer les effets de la charge émotive 
u stress. 
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Das Problem der Ermiidung beim Kraftfahrer ist zweifach. Es umfasst sowohl die Ermiidung 
durch die Tatigkeit des Fahrens, als auch ermiidende Wirkungen irgend welcher anderen Herkunft 
auf das Fahren. Das Studium der Ermiidung bie Kraftfahrern ist dadurch erschwert, dass 
weder eine zuverlassige Methode der Ermiidungsmessung noch auch eine brauchbare Definition 
der Leistung des Kraftfahrers existiert. Die Vorliegende Arbeit sichtet einen Teil der Unter- 
suchungen tiber Arbeits-Ermiidung und diskutiert die Ursachen der Ermiidung beim Kraftfahren. 
Es wird angenommen, dass Belastung (‘‘ stress ”) eine wichtige Ursache der Fahr-Ermiidung ist. 
Verschiedene Methoden zur Feststellung der emotionellen Wirkung, die durch “stress ” 
hervorgerufen wird, werden besprochen. 
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SOME EXPERIMENTAL STUDIES OF RADAR OPERATING 


By D. Watts and J. A. SamuEL 


Division of the Senior Psychologist, 
Manpower Department, Admiralty 


An account is given of experiments in which several critical features of the 
radar operator’s task were examined. Experienced operators were used in each 
ease, and special simulators were constructed to give realistic but controllable 
presentations. 

Throughout the first study, subjects were required to undertake an auditory 
detection task at the same time as maintaining a visual search on the radar display. 
Each experimental watch lasted for 3 hours and was preceded and followed by a 
measurement of ‘optimal’ performance under psycho-physical test conditions. 
Results were analysed to compare the effects of ‘ continuous’ and ‘ interrupted ’ 
radar operating, and to show how performance on watch is related to ‘ optimal ’ 
standards. Trends throughout a single session, and throughout five successive 
daily or weekly ones, were examined. 

The second experiment was designed to assess the influence of target density 
(the number of contacts displayed simultaneously) and of blip-scan ratio (proba- 
bility of ‘ paint’) upon the detection of a new contact. Experimental sessions 
lasted for 30 minutes. An interaction between the two variables was established. 

A third investigation is reported in which the eye-movement patterns employed 
in radar research were recorded by the electro-oculographic method. This was 
an exploratory study only, but it revealed consistent evidence of unsystematic 
visual scanning, with wide differences between patterns used by the operators. 

The overall results and their imphcations for radar operating are discussed. 


§ 1. INTRODUCTION 


THE primary function of a radar operator is to detect and identify signals 
which reveal the location of critical ‘contacts’. A characteristic form of 
radar display is the plan-position-indicator (P.P.I.). It employs the circular 
face of a cathode-ray-tube to give map-like representation of the search area 
swept by a radar antenna. Signals appear as bright spots ‘ painted’ on at 
the appropriate range and azimuth by a rotating radial line of light; they are 
likely to be masked by ‘ noise’ or ‘ clutter’ (e.g. returns from rain or land). 
The whole display brightens as the radial line revolves, but it gradually fades 
away until restored by the subsequent revolution. 

Psycho-physical relationships between display variables and detection 
thresholds have been extensively studied under laboratory conditions. 
(Morgan 1952, has summarised the earlier work.) Necessary though such 
data are, however, they rarely permit more than the prediction of optimal 
performance, as defined by the operator who is fresh, alert, and fully aware 
of when and where a signal will appear on his display. Unfortunately, this 
gives at best an incomplete picture of what happens under real operating 
conditions. Since the pioneering studies of Anderson (1944) and Mackworth 
(1950), more attention has been devoted to the examination of trends in per- 
formance when operators are on watch for long periods and cannot anticipate 
the occurrence or location of radar contacts. 

An account is given in the present paper of some experiments which sought 
‘to throw more light on the nature of radar operating, when it is practised 
under more realistic conditions than have usually been reported. 
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§ 2. PerFoRMANCE Durinc Lone WATCHES 


THE first project was occasioned by a practical requirement to maintain a 
continuous watch on an airborne radar for several hours on end. At the same 
time a listening watch on a Direction-Finding (D.F.) system had to be main- 
tained. Two operators, one officer and one rating, were available in the 
aircraft, but in separate cockpits. Each man could be supplied with a 5 in. 
P.P.1. display from the search radar, and with a headset to receive audible 
D.F. signals. Brief interruptions of his radar watch had to be permitted for 
one or other of the operators, to accommodate regular navigational computa- 
tions. No break in the listening watch was permissible, however. 

In this situation each man was faced with two simultaneous perceptual 
tasks. He scanned his P.P.I. for radar signals, and at the same time listened 
for characteristic audible signals. The two channels of information were 
entirely uncorrelated, and the signals in either one would be infrequent and 
noise-masked. 

A full-scale mock-up of the aircraft cockpit was constructed and equipped 
with apparatus to simulate the radar and the D.F. listening channel. Radar 
echoes (2mm diameter) were introduced against a controlled but changing 
background of noise and clutter; appropriate brightening and decay character- 
istics for the radial time-base and signals were shown, to accord with an 
antenna rotation rate of about 40r.p.m. The auditory signals consisted 
each of 3 pure-tone pulses at 835 c/s in succession, each pulse lasting for 
20 m/secs and separated from the others by 1} sec intervals. They were 
masked by a ‘ white noise’ background which was fed continuously into the 
subject’s headphones at an overall sound pressure of 75 dB re 0-0002 dyne / em?. 

Within the cockpit, the ambient temperature and noise intensity were 
respectively held constant within +3° of 62°r (dry-bulb) and +2 dB of 
66 dB re 0-0002 dyne / em?. Cockpit illumination was not measured in 
absolute units, but was maintained by means of external lighting at a constant 
subdued ‘ twilight ’ level. 


2.1. Hxperimental Procedure 


A high level of motivation was encouraged by means of an explanatory 
‘hand-out’ and a personal interview of each subject before the experiments 
began. Radar and D.F. signals were individually demonstrated, and 5—10 
min practice was given, with knowledge of results. Instructions emphasised — 
the need for continual alertness on watch, and that a response button placed 
below the P.P.I. display was to be pressed immediately a fresh radar or auditory 
signal was detected. 

The design of experiments was built around a 3-hour watchkeeping session - 
in the cockpit, during which a pre-arranged programme of signals unfolded — 
for each subject. There were always 10 separate radar signals and a further — 
10 auditory signals during each hour, dispersed irregularly and introduced — 
without warning; signals never occurred simultaneously on both channels. — 
Radar signals were repeatedly ‘ painted’ on each rotation of the radial line 
until they were perceived. Auditory signals were introduced at an intensity — 
below the threshold for detection in noise and increased by 1 ds steps every 
4 secs until responded to. A confirmatory verbal report was required after 
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the initial detection response; in the case of a radar echo, this report had to 
specify the spatial location of the displayed signal. 

Subjects always attended in full flying kit, which included the standard 
aircrew headphones and microphone for the communication of reports and 
instructions to and from the experimenters. A short period of familiarisation 
with the procedure was allowed before the first experimental session, whilst 
every session was preceded by 5 min of formal practice. 

Immediately after this practice, measurements were taken of the subject’s 
‘optimal’ performance. For radar operating, this was defined as the average 
response time to 10 successive radar echoes presented with a prior alerting 
signal. Optimal listening performance was measured as the average signal- 
to-noise ratio at which 10 successive audible signals were just perceived, 
again with a warning before each signal occurred. Similar measurements of 
‘optimal’ detection were repeated immediately after each 3-hour session 
ended. 

For the first experiment a group of 10 operators experienced two complete 
sessions in the cockpit. During one session both radar operating and the D.F. 
listening watch were entirely continuous. In the other session radar operation 
was interrupted for regular and specified periods totalling 15 min during each 
of the three hours. Exercises in navigational computation and plotting were 
taken up and continued in the cockpit throughout these breaks, whilst the 
subject tried to maintain his listening watch. Table 1 shows how the signals 
were deployed during these ‘ interrupted’ radar sessions. 


Table 1. Plan of each Successive Hour During a 3 hour Session (‘ Interrupted ’ radar Watch) 


Serial Minute Operating Tasks Number of Signals 
Number 
0-5 D.F. listening -+ Navigation HAD) Veh 
6-20 as + Radar Operating 2 D.F.+4 Radar 
21-30 ae + Navigation 2) Di i. 
31-60 PE + Radar Operating 5 D.F.+6 Radar 


The second experiment continued with the same subjects, each of whom 
now underwent five sessions of ‘ interrupted’ radar. An interval of one week 
separated each session. Finally a third experiment was conducted with the 
remaining 12 subjects, who were given five sessions separated by one day only. 


2.2. Results 


Only the main findings will be reported here. Details of the breakdown 
of data and statistical analyses have been given by Wallis & Samuel (1956). 
So far as radar operating is concerned, the experiments allowed a comparison 
between continuous and interrupted working, and an assessment of trends in 
performance under the latter condition throughout successive daily or weekly 
spells of operation. as tae) 
Taking the session as a whole, radar performance was significantly inferior 
when it was continued without a break. Moreover, the subjects were less 
efficient during the second hour than during the first; they recovered partially 
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during the final hour, but only at the expense of a corresponding decline from 
the level of efficiency at which their simultaneous listening watch had been 
maintained. The only surprising feature of the downward trend in radar 
detection was that it did not appear till after an hour. This is much later 
than was expected on the basis of earlier findings from laboratory studies of 
vigilance (Mackworth 1950). However, the stress of continuous radar operation 
produced an adverse side-effect, since the average intensity at which the D.F. 
signals were just perceived was significantly higher throughout each successive 
hour, among the subjects who scanned their displays without a break. 

During the second and third series of experiments, when the radar operating 
was always ‘interrupted’, the advantages of this mode of operation were 
clearly confirmed. With one exception, various analyses of the results within 
and between sessions failed to reveal any trends, either in average level or 
variability of response. The only sign of accumulating stress was an increased 
variability of detection time among those operators who had completed three 
successive daily sessions. The removal of any tendency towards a progressive 
decline in operating standards as a watch proceeded, was anticipated on the 
basis of Mackworth’s results. It confirmed the advisability of breaking up a 
radar watch into periods not exceeding one hour. Even so, it must be noted 
that occasionally there were brief but severe lapses of attention which led to 
abnormally long delays before a radar signal was detected. These lapses 
were just as liable to occur early in the session as later, as is evident from 
Figure 1. 
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Figure 1. Detection of echoes: performance through ‘ interrupted ’ watches of 3 hours duration. 
(Data from 12 subjects, averaged over 5 daily watches.) 


It is interesting to note that the efficiency of the auditory D.F. watch 
paralleled that of the radar watch in all these later experiments. Not only 
were there no signs whatever of any changes in the average detection thresholds 
for auditory signals, but individual consistency was markedly high. Only 
rarely did any operator fail to detect a signal at an intensity within +2 dp of 
his own average response level. Reports from the subjects alleged that, 
subjectively, they were able to give all their attention to radar operating and 
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could respond quite ‘ automatically’ to the sudden intrusion of a relevant 
auditory stimulus. 

Immediately before and after each prolonged watch, measurements had 
been taken of how well the subject could detect when he was warned that a 
signal was imminent. These pre-watch and post-watch indices of ‘ optimal ’ 
performance did not differ significantly. It was clear, therefore, that the 
intervening ordeal of three hours in the cockpit induced no ‘ fatigue ’ of sensory 
functions. Yet among all subjects there was a marked fall-off in performance 
from this optimal level as soon as the proper session began; i.e., although the 
radar operators maintained a constant average level of performance throughout 
successive hours, they never managed to perform as well as they were demon- 
strably capable of doing when specifically alerted. The results are shown in 
Figure 2. 
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Figure 2. Comparison between optimal performance and detection during ‘ interrupted ’ watches. 
(Data from 10 subjects). 


This is an effect which has been noted in subsequent radar experiments, 
even with much shorter periods of operation. It occurs no matter how 
conscientiously the operator persists at his task, and seems to be related to 
the irregularity, infrequency and unpredictabliity with which signals actually 
appear. The same limitation, first demonstrated by K. Elliott at the Admiralty 
Research Laboratory (Elliott 1959) shows up in auditory watchkeeping tasks; 
as indeed it did in the synthetic D.F. listening watch employed in our exper- 
ments. 

Another characteristic was revealed by analysing the responses to echoes 
displayed in various parts of the P.P.I. Several investigators have noted that 
some parts of the display tend to be ‘ favoured ’, to the detriment of an efficient 
search when there is an equal probability of echoes appearing any where 
(Baker 1958). We found that detection was noticeably less reliable at long 

ranges, where a larger area for visual scanning is imposed by the P.P.I. form 
of display. This, as shown in Figure 3, was so even when the operator was 
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fresh and alert, as in the pre-watch tests mentioned above ; and it was aggra- 
vated during normal search conditions. It suggests that the visual scanning 
patterns of radar operators may impart a bias against peripheral detections 
on a P.P.I., and that as time on watch increases, the bias is accentuated. 
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Figure 3. Detection in various display sectors and range bands. (Data from alerted subjects 
i.e. from pre-watch tests, averaged over 5 weekly sessions). 

Finally, it should be emphasised that wide individual differences were 
demonstrated among*operators. As groups, officers and ratings were not 
distinguishable; which is not surprising, since all the subjects seemed to be 
well-motivated during the experiments, and the operating tasks were easily 


comprehended. Yet although these subjects were all trained in radar and were ~ 


well practised at the experimental task, they varied markedly and consistently 
from one to another within their respective groups. Significant positive 
correlations (r= + 0-72, p< 0-05 and 7 =+ 0-70, p< 0-01) were obtained 
for each group of subjects between performance under ‘ optimal’ and under 
‘watchkeeping ’ conditions. Taken together with other internal evidence, 
this suggests that the differences between operators during actual watchkeeping 
sessions may have been attributable to differences in ability to search the 
display effectively, rather than to differences in general alertness. (Even under 
the ‘ optimal ’ conditions of the pre-watch measurements described previously, 
the subjects had still to locate the radar signals on their display.) 


§ 3. RapaR REPORTING ON NumMEROUS Contacts 


If the radar display is regarded as imposing a mental ‘ load ’ on the operator, 
then clearly this load will vary within wide limits, according to operational 
circumstances. At one extreme there may be a very low incidence of ‘ relevant’ 
signals, against a changing but perceptibly-distinct background of noise and 
other masking phenomena. Here the operator’s load may seem to be minimal; 
yet the task is exacting, since it demands constant vigilance, systematic 
searching of the display, and the ability to discriminate an unpredictable 
target at the lowest possible signal-to-noise ratio. The investigation just 
described was the first of several into this ‘ search’ phase of radar operating. 

At the other extreme the operator’s load is qualitatively as well as 
quantitatively different. When there are many relevant targets already 
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displayed, and others are expected, the maintenance of alertness presents 
little difficulty. Identifying a new target, however, becomes progressively 
harder for the operator; for he has to remember the whereabouts of ‘old’ 
targets and go on watching them, besides searching for new ones. 

The following experiment was undertaken to estimate how great a load, in 
terms of target density (the number of targets painting simultaneously), lies 
within the capacity of a skilled operator. In practice, of course, target 
detection and identification are more difficult if probabilities of paint are low. 
For this reason, blip-scan ratio as well as target density was systematically 
varied. (The blip-scan ratio expresses the probability of a contact ‘ painting ’ 
on the display; empirically, it is found by dividing the number of successive 
antenna rotations into the number of occasions when a ‘ paint’ occurred.) 

A new similator was constructed to provide a controllable but realistic 
radar presentation, this time with a data of 6r.p.m. Six experienced operators 
acted as subjects, using a 5in P.P.I. display. Each man carried out nine experi- 
mental watches of 30 min duration, the watches being taken on successive 
days. During the first 5 min the radar picture built up to one of the three 
conditions of target density, which prescribed 5, 10, or 15 radar echoes to be 
showing throughout the watch. A further 10 new targets appeared singly 
at random intervals during the remaining 25 min. To preserve the required 
target density one of the existing targets would fade just after a new one 
appeared. All targets tracked slowly across the display. Three different 
blip-scan ratios, of 0-5, 0-75, and 1-00, were employed, so that each subject 
experienced every combination of target density and blip-scan ratio. 

The operating task was to press a response button as soon as a ‘new’ 
target appeared, confirming the detection by a verbal report of its range and 
bearing. All target echoes were similar in appearance; they were about 1} mm 
in diameter and just bright enough to be clearly visible when directly pointed 
out. 


3.1. Results 


It was known from several unpublished reports that the probability 
of detecting radar targets was related directly to the size of the 
blip-scan ratio and inversely to the target density. In practice, both 
variables must be considered together, and it was suspected that the interaction 
between them may lead to a critical breakdown in effective operating. 

Results were analysed with the object of revealing such an effect if it were 
present, using the criteria of completely undetected targets and ‘ false * detect- 
ions (i.e. already-existing contacts which were incorrectly reported as new 
ones), as well as response times for correct detection, to assess operating 
efficiency. Fig. 4 illustrates the relations between the two variables and the 
response times for targets correctly identified as new detections. There 
certainly appears to be a trend for increased target densities and lower blip-sean 
ratios to degrade performance, and an interaction between these variables 
shows up clearly. Nevertheless, only the interaction proved to be statistically 
significant; a sharp decline in performance occured only when the target 
density was more than 10 and the blip-scan ratio less than 0-75. 

Changes in blip-scan ratio, considered independently, had no effect upon 
omissions or false detections, whereas both these measures increased significantly 
ERG f L 
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at the highest target density. No interaction between the variables appeared 
here. The overall conclusion was that the efficiency of radar operating can 
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Figure 4. Effect of target density and blip-scan ratio upon detection of new targets. 


be seriously jeopardised, particularly at low blip-scan ratio, if the operator — 


is forced to discriminate among more than 10 simultaneous contacts. This 
was confirmed in later experiments where fast-moving aircraft targets were 
presented on a P.P.I. display at a data rate of 15 r.p.m. 

A final effect which should be noted was the significant individual differences 
among these subjects, highly-skilled and experienced though they were, in 
the speed with which they could discern new targets. All subjects were prone 
to make false detections occasionally. 


§4. Some IvpLicaTions FoR RADAR OPERATING 


Before going on to mention work which is still in progress, let us sum up 
the main implications of these two investigations. Subsequent field trials 
using actual radar equipments under realistic conditions of operating, have 
produced results which support the following generalisations. 


4.1. Characteristics of Watchkeeping Performance 


It has become a classical doctrine that operators cannot remain alert 
during a surveillance watch, that they decline steadily, and that half-an-hour 
is as long as they should remain on duty without a break. Whilst there is 
some support for this view, the picture is really much more complex, and on the 
whole much less pessimistic. 

In the first place, one must distinguish between ‘low’ and ‘high’ load 
conditions of reporting. Paradoxically, it is when the radar activity is minimal 
that efficiency is most likely to suffer. When the relevant signals are infrequent 
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and cannot be precisely anticipated, which is characteristic of ‘ early warning ’ 
and search phases of operation, circumstances are certainly unfavourable to 
human operators. They will tend always, even from the start of a brief watch, 
to respond less efficiently and consistently than might be predicted from a 
knowledge of how well they can detect signals which they can anticipate in 
time. 

There is more to this than the simple assumption that operators ‘ doze off’ 
or are irresponsible. Unless alleviating measures are taken, the effect is present 
even though the personnel are well trained and sincerely striving to keep a 
vigilant watch. Fundamentally, it is explicable in terms of the psychological 
need which all human beings have for a ‘ lively ’, perpetually-variable environ- 
ment in order to conserve a high level of ‘ arousal’ (Bexton, Heron & Scott 
1954) in the central nervous system. In the absence of variable stimulation 
from the physical surrounding, the sensory and nervous mechanisms are 
inevitably depressed. This is one reason why it is feasible to improve radar 
performance by introducing a less monotonous, more active, and perceptually 
more viable environment for radar operating than is often met with in practice. 
Experiments in Canada (Smith & Boyes 1957), confirmed in most respects by 
one of our colleagues (N. S. Kirk) with Service equipment, have shown that 
a moderately high level of illumination in radar compartments, together with 
careful setting up of displays, is beneficial to efficient reporting. Other measures, 
too, are practicable for creating a stimulating environment in which to operate. 

An associated factor in ‘low load’ conditions is the apparent regulation of 
performance by the frequency with which relevant contacts appear on the 
display. It is as though the operator’s readiness to respond was conditioned 
by an estimate of the likelihood that anything will in fact be presented. This 
subjective ‘expectancy’ is presumably based on experience from the onset 
of a watch, and will be least when the actual frequency of signals is minimal. 
Various investigators have shown that the efficiency of detection can be raised 
by artificially raising the signal rate, i.e. by periodically injecting ‘ false ’ 
signals. (Deese & Ormond 1953). 

An experimental system based on this idea was devised and tried out here 
(Wallis & Newton 1957), but in the form used its results were insufficiently 
promising to warrant itsdevelopment. Nevertheless, in the light of knowledge 
gained in this study, it seems likely that a substantial aid to surveillance 
operations could be produced, in which artificial signals are added whenever 
‘real’ activity is excessively low. Precautions against the consequences of 
false alarms could certainly be built in. (It may be pointed out here that 
operators will produce a remarkably sensitive performance if they are en- 
couraged to report anything which conceivably might be a contact, without 
waiting for the confirmation of successive ‘paints’. This is tantamount, of 
course, to an artificial raising of the signal frequency, for it increases the proba- 
bility of detecting real signals at the cost of an increase in ‘ false ’ detections). 
So far as ‘high-load’ operating is concerned, we must recognize that 
Ithough some continuous activity on the display is beneficial, there are upper 
imits to the amount of information with which an operator can deal. A 
umber of variables influence these limits, so that one cannot generalize with 
my confidence. Circumstances which increase the irrelevant information, 
uch as the presence of jamming and clutter, will naturally be detrimental. 
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So will those which reduce the relevant information, such as excessively low 
blip-scan ratios and signal-to-noise ratios. In any event the target density 
will be a limiting factor, and it is safe to assume that operators will be unlikely 
to cope effectively if there are as many as 10 contacts to be watched simul- 
taneously on the display. 

Finally, it is probable that ‘ fatigue’ during a watch, in so far as this 1s 
manifested by a progressive decline in efficiency, is not a serious problem. It 
can be circumvented altogether by allowing a break in continuous operation 
after about half-an-hour, provided that the intervening period is occupied in a 
quite different kind of activity. Should it be operationally necessary, a watch 
which includes 10 min breaks at these intervals could be prolonged for three 
hours—though this is definitely not to be recommended as general practice ! 
We have confirmed several times that an hour’s continuous watch can be 
maintained without a steady decline in performance when contacts. are not 
less frequent than one every 10 min. 


4.2. Individual Differences in Skill 


Perhaps the most significant findings are those showing the marked varia- 
bility which exists even among trained and experienced operators. This 
variability cannot be attributed to differences in visual acuity or general 
intelligence, at least among Service operators. It seems likely that personality 
factors may be influential, since radar operators certainly vary in the strength 
of their liking for the work and in the overt signs of stress or relaxation which 
they display on watch. There is a hint in Broadbent’s (1958) account of 
vigilance research that the personality dimension of introversion-extraversion | 
may be related to success in ‘ monitoring’ occupations; but this has not so 
far been confirmed for radar operation. 

Most of the research which is relevant to individual differences in radar 
operating has stemmed from the problems of training personnel in the percep- 
tual skills involved. These training problems cannot be solved without a 
clear understanding of the radar operator’s skill, one element of which seems 
to be the ability to carry out a systematic search of the visual display. The 
last part of the present paper will be concerned with a preliminary study of 
this factor. 


§ 5. Eyr-Movements Durinc Rapar SEARCH 


Human beings tend to vary more among themselves at visual tasks than 
at auditory ones; and they vary still more when the visual task contains a 
large element of search, as in scanning a radar display. Earlier in this paper 
it was pointed out that the ‘ order of merit ’ produced by individual differences 
in the efficiency of prolonged radar watchkeeping, is substantially the same as 
that produced if the operators know when a target is about to appear although 
they do not know where to look for it on the display. These findings suggest 
that some of the variability between operators is directly attributable to varying 
degrees of skill in scanning the display. 

The data illustrated in Fig 3 support the contention that some parts of 
a P.P.I. display tend to be ‘ favoured ’, even though the appearance of signals 
is equiprobable anywhere. Here is further evidence that patterns of visual 
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Scanning may incorporate undesirable biases away from a theoretically- 
optimal search, and may indeed be a cause of the surprising irregularities in 
the level of efficiency which appear from time to time during a watch. Un- 
fortunately, explicit training in how to scan a display is not at present practicable, 
especially in relation to prolonged periods of systematic search, since no-one 
knows which methods are best or what operators are capable of. 

Many operators seem to devise their own procedures. The majority 
claim that their usual method on P.P.I.s is to search just behind the radial 
time-base, but at data rates exceeding about 30 p.m. this becomes impossible, 
The subjects employed in our experiments made varied assertions about the 
patterns of search adopted, depending on the size of display and the data rate. 
The reliability of many of these claims may be doubted; but until recently 
there was no sufficiently-sensitive means of recording fixations and eye- 
movements, with which to examine the matter objectively. 

However, several promising techniques have now been developed, one of 
which involves the use of equipment known as the Eye Movement Measuring 
Apparatus (Shackel et al 1958). This equipment, referred to as E.M.M.A., has 
been used in conjunction with the radar simulator devised for target density 
studies, to make preliminary observations of eye-movements during radar 
scanning (Samuel 1958). The method, known as electro-oculography, involves 
the pick-up and amplification of changes in potential difference between the 
front and rear of the eyeball, changes which occur as the eye-ball moves. 
With the E.M.M.A. equipment it is possible to observe fixations, and 
movements of the eyes as the point of visual fixation varies, in terms of 
the displacements of an oscilloscope spot. This display can be photographed 

_ at the same time as a monitoring display of the P.P.I. to which the experimental 
subject is attending. 

A group of experienced operators have been ‘monitored’ during 30 min 
watches ona P.P.I. display. Their task was to search for new ‘ pips ’ appearing 
among those already present. A data rate of six per min was employed and 
the diameter of the display was 5in. From the viewing distance of 17 in., the 
maximum angular deflection of the point of visual fixation was 17° across any 
diameter of the tube face. Certain restrictions were imposed on the operator 
by the use of electrode fittings near the eyes, but it was established that these 
did not detract from his ability to detect. (It is not verifiable, of course, that 
his eye-movements were entirely unaffected). 


This investigation showed : 

(i) There were wide differences between operators in the way they used 
their eyes to search the display, although a basically-circular pattern 
predominated. 

(ii) Not only was there a general tendency to view the inner portion of 
the P.P.I. more closely than was appropriate—but a considerable 
proportion of the time was spent looking away from the display alto- 
gether. 

Figure 5 gives a qualitative picture of what occurred in practice. It records 
the movements of one operator’s eyes during six successive sweeps of the rotating 
time-base. Each ‘ frame’ therefore accounts for a 10 sec period, during which 
the eyes moved between all the points connected. The order in which these 
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movements occurred is not shown. The large number of movements certainly 
implies that the operator was trying hard to find a target, which in the present 


Figure. 5. Movements of the eyes during six successive rotations of radar scan 
(The larger black dots e.g. at 4 o’clock on frame 5, show where echoes painted). 


case he seemed to expect somewhere in the upper half of the display. How- 
ever, it is most unlikely that so intensive a search could be maintained for more 
than a few minutes. Nor is it likely that the more-or-less random nature of 
successive fixations afforded the most efficient or economic search pattern. 

We cite this as only one example of the recorded patterns. Operators 
vary even with the same display; we suspect that display size, data rate, and 
time on watch, will all add to the variability. Rather than pursue the imposs- 
ible path of recording what happens in all circumstances, it is necessary to go 
back to the beginning, as it were, to investigate just how effectively the eye 
can detect marginal signals when these are displayed at varying angular 
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deflections from the point of foveal regard (i.e. fixation point). This research 
is being conducted by our colleague N. S. Kirk. It is hoped to use the results 
as a basis for predicting what the most economic and efficient search patterns 
should be for different circumstances. Such information, if it should prove to 
be reliable in subsequent experimental studies, should be of great assistance 
not only to designers of future displays but also for the training of operators. 


The two main experiments referred to here were carried out when the 
authors were attached to the Admiralty Research Laboratory, Teddington. 
Members of the Naval Motion Study Unit who took part included Messrs. 8. J. 
Murch, W. D. Laurie and C. McCarthy (now with the Fleet Work Study Organ- 
ization). Sincere thanks are due also to the Naval Officers and Ratings who 
acted as subjects in the various experiments. 

The views expressed in this paper are those of the authors, and should not 
be taken as representing the official opinion of the Admiralty., 


Cet article donne un compte-rendu d’expériences dans lesquelles on a étudié plusieurs carac- 
téristiques critiques du travail de ’opérateur radariste. Dans chaque expérience on a fait appel 
a des opérateurs entrainés et on a construit des simulateurs spéciaux permettant de réaliser des 
conditions de tache réelle et contrdlable. 

Dans la premiére expérience on a demandé aux sujets de réaliser une détection auditive tout 
en explorant par la vue l’écran radar. Chaque période de surveillance expérimentale durait 
3 heures et fut précédée et suivie par une mesure de la performance “ optimale ”’ lors d’épreuves 
psycho-physiques. L’analyse des résultats porte sur la comparaison des effets du travail 
“continu” et du travail “ interrompu’”’ et montre comment la performance & une tdche de 
surveillance peut étre mise en rapport avec des types “‘ optima ”’. Ona tenu compte du “ trend ” 
au cours d’une seule passation, de méme qu’au cours des cing jours ou semaines consécutifs. 

La deuziéme expérience devait permettre l’étude de l’effet de la densité des signaux (nombre 
de contacts présentés simultanément) et du rapport signal-balayage (probalité d’un “‘ affichage ”’) 
sur la détection d’un nouveau contact. Une passation durait 30min. Ona pu montrer l’existence 
dune interaction entre les deux variables. 

Dans une troisiéme expérience on a étudié par une méthode électro-oculographique, le 
schéma moteur oculaire utilisé dans l’exploration de l’écran radar. Bien que cette étude ne 
fat qu’une étude d’approche, elle a permis de faire apparaitre trés nettement une exploration 
visuelle non systématique avec de grandes différences entre les schémas utilisés par les opérateurs. 

La discussion porte sur les résultats généraux, ainsi que sur leur application a l’exploration par 


radar. 


Es wird iiber Experimente berichtet, in denen einige kritische Besonderheiten der Aufgaben 
eines Radar-Beobachters untersucht wurden. In allen Fallen wurden erfahrene Beobachter 
herangezogen und besondere Simulatoren konstruiert, die wirklichkeitsgetreue aber Kontrollier- 
bare Darbietungen geben. 

Im ganzen ersten Versuch hatten die Versuchspersonen gleichzeitig mit der visuellen Suche auf 
dem Radarschirm akustische Signale zu entdecken. Jede experimentelle ‘‘ Wache”’ dauerte 3 
Stunden. Vorher und nachher wurde die “ optimale ” Leistungsfahigkeit unter psycho-physikali- 
schen Testbedingungen gemessen. Die Ergebnisse wurden analysiert, um die Wirkungen * fort- 
laufender ” und “ unterbrochener ” Radarbeobachtung zu vergleichen und um zu zeigen, wie die 
Leistungen bei der Beobachtung sich zu den optimalen Test-Standards verhalten. Trends 
iiber eine einzige Sitzung oder iiber aufeinanderfolgende Sitzungen taglich oder wéchentlich 

n studiert. 
Bac ee Experiment beabsichtigte, den Einfluss der “ Zieldichte ” (Zahl der gleichzeitig 
gezeigten Kontakte) und den Einflus der Wabhrscheinlichkeit des Auftretens eines Kontaktes auf 
das Entdecken eines neuen Kontaktes festzustellen. Die experimentellen Sitzungen dauerten 
30 Minuten. Zwischen den beiden Variablen wurde eine Wechselwirkung nachgewiesen. — 
“Bs wird iiber eine dritte Untersuchung berichtet, in der die Augenbewegungsmuster, die bei 
der Radarbeobachtung benutzt werden, durch elektro-oculographische Methoden registriert 
wurden. Dieses war nur eine Forschungsvorstudie, die aber gewichtige Beweise fiir ein un- 
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systematisches visuelles Priifen brachte, mit grossen Unterschieden zwischen den Methoden, 


die von den Beobachtern benutzt wurden. 
Die Gesamtergebnisse und ihre Tragweiten fiir das Radar-Personal werden diskutiert. 
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SOME CHARACTERISTICS OF INSTRUMENT MEASURING 
DIALS 


By Anvorne PaPaLoizos 


Department of Industrial Psychology, Ebauches, 8.A., Neuchatel, Switzerland 


Two experiments were undertaken to evaluate the legibility of dials. The 
results of both experiments showed that one factor was extremely important for 
correct and quick reading : that the major graduations (the 10s and 5s) on the one 
hand and the dial + minor graduations on the other should be in a relationship of 
figure and ground. The special case of omitting minor graduations from the scale 
is also examined. 


THIS paper reports two experiments which were undertaken to determine the 
importance of different characteristics of dials with respect to their legibility. 
The field of dial design has been reviewed recently by several authors (Chambers 
1956, Murrell 1957, Tufts College 1952, Woodson 1956) and a large number of 
contributions have been made. The relevant variables are, however, many 
and there is still room for a great deal of further work. 


63,2 mm 


Figure 1. General view of the dial. 


The dials with which the present study was concerned were to be fitted 
to a comparator of predetermined shape and dimensions shown in Fig. 1. 
These two factors were therefore given and could not be changed. Although 
in consequence restricted in scope, the study is believed to confirm and, in 
certain respects, to amplify previous knowledge relevant to dial design, 
especially as regards colour, upon which little previous work seems to have 
been done. The results suggest a principle of dial legibility which is discussed. 
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§1. Experiment [ 


Four areas were chosen for study : dial frame, dial hand, graduations and 
dial colour. 


Chamfer 2mm 30° a 


Small upper arc A=27,6mm 


Large upper arc A=29 6mm 


Figure 2. 
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Figure 3. 
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For the first three areas, three variables, each with two levels, were 
investigated. For the fourth area (dial colour) there were two variables, one 
with two levels and one with four. 
tions were obtained, 2 x 2 x 2 for the first three and 2x 4 for the last as shown 


Thus, for each area eight possible combina- 


in Table 1. 
Table 1. Scheme of Experiment I 
Area of study Variable 1 Variable 2 Variable 3 
1, Dian Frame Saturation of grey : Upper are of window : Finish : 
(a) Light (Munsell valve (a) Small (a) Matt 
N7) 
(6) Dark (Munsell valve  (b) Large (b) Glossy 
N4) 
(The dimensions are 
shown in Fig. 2) 
2. Dian Hand Width of pointed end: Length: Colour : 
(a) Thick (i.e. the same (a) Long (ie. overlapp- (a) Same as major 
width as the major ing the minor gradua- graduations 
graduations 0, 5, 10, 15, itons) 
etc.) 
(6) Thin (i.e. the same (6) Short (ie. touching (6) Different from major 
width as the minor’ but not overlapping the graduations 
graduations) minor graduations 
3. GRADUATION Colours of majors and Colour of minors : Height of minors (see 


minors : 


Fig. 3) : 


(a) Same (a) All one colour (a) All the same 
(6) Different (6) Two colours (b) Differing 
4, Diat CoLour Saturation : Hue : 
(a) Light (a) Neutral 
(6) Dark (b) Yellow 
(c) Green 
(d) Blue 


The colours used for the graduations and dials are set out in Table 2. 


Eight subjects participated, 4 men and 4 women. 
two of the women had already worked with comparators ; the other four 
subjects had no experience of this type of work. 


Graduation “S” 


°,30mm 


10 


Two of the men and 


Graduation "G" 10 


10 16) 
N rte 


Figure 4. 


64 dials were constructed, representing the combinations of the eight 


graduations with the eight colours. These dials were in the form of circular 
disks which were placed in a vertical position on a screen at a distance of 
- 30 cm from the subject’s eyes and under an illumination of 2401x. The dial 
frame and the hand could be fixed on the apparatus separately. On the 
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experimenter 
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’s side this consisted of a scale on which a pointer could be moved 


to the position corresponding to the dial hand on the subject’s side. A shutter 
controlled by an electric timer enabled exposures of 0-2 sec to be made. ' 
A factorial design randomised the following variables : subjects, dial 
frame, dial hand, graduations, colour of dial, time of day when readings were 


made, day on which readings were made, lists of readings. 


lists of numbers which constituted the readings to be made. 


Dia 
No. 


WIMP Coho 


HH eee ee 
OmnPWNrOO 


Dial hue 


pale yellow 


grayish yellow 


light blue 


ss 


TABLE 2. 
Munsell Hue of 
notation major 


NO 


2.5Y 8/4 


5Y 7/4 


10 G 8/2 


light greenish blue 7.5B.G7/6 


” 


dark blue 


2 


5.PB 2/2 


N1 


graduations 


black 


32 


white N9 


black NI 


Length of 
minor 
graduation 


same length 


different length 


2 


same length 
rE 


33 


different length 


same length 
2 


2 
fae 3 
different 


3 


length 


29 


23 


same length 


or 


different length 


same length 


different 


cry 


length 


same length 


2” 


2” 


different length 


” 


2 


same length 


” 


different 


same length 


2 


” 


different length 


a= Munsell notations appear once for the same colour. 


length — 


Hue of minor 
minor 

graduations 
below “5” s 

strong blue 

cherry red 

black 

cherry red 

black 

cherry red 

strong blue 

cherry red 


strong blue 
cherry red 
black 
cherry red 
black 
cherry red 
strong blue 
cherry red 


dark green 
cherry red 
white 

cherry red 
white 

cherry red 
dark green 
cherry red 


dark green 
cherry red 
black 

cherry red 

black 

cherry red 
dark green 
cherry red 


chinese orange 
cherry red 
black 

cherry red 
black 

cherry red 
chinese orange 
cherry red 


chinese orange 
pale blue white 
white 

pale blue white 
white 

pale blue white 
chinese orange 
pale blue white 


strong yellow 
strong blue 
white 

strong blue 
white 

strong blue 
strong yellow 
strong blue 


strong yellow 
strong blue 
white 

strong blue 
white 

strong blue 
strong yellow 
strong blue 


COLOURS USED IN EXPERIMENT I 


Munsell 
notation 


5PB 3/10 
7.5R/14 
Nl 


2.59 3/6 


a 


5R6/12 


Peevey erra Seerew 


5B 7/4 


oT SHpewyeoenwes 


-Y 8/10 


Senpreee 


popes pes 


There were eight 


The numbers 


Hue of minor 
minor 

graduations 

above “5’’s 


strong blue 
black 
black 
strong blue 
black 
strong blue 
strong blue 
black 


strong blue 
black 
black 
strong blue 
black 
strong blue 
strong blue 
black 


dark green 
white 
white 
dark green 
white 
dark green 
dark green 
white 


dark green 
black 
black 
dark green 
black 
dark green 
dark green 
black 


chinese orange 
black 
black 
chinese orange 
black 
chinese orange 
chinese orange 
black 


chinese orange 
white 
white 
chinese orange 
white 
chinese orange 
chinese orange 
white 


strong yellow 
white 
white 
strong yellow 
white 
strong yellow 
strong yellow 
white 


strong yellow 
white 
white 
strong yellow 
white 
strong yellow 
strong yellow 
white 


b=Munsell notations same as preceding column. 


Sign 


and 


b bottom 
b top 
b 


b bottom 
b 39 
b top 
1D os 
b bottom 


b bottom 
b top 


b 22 
b bottom 
bess 
b top 
b bottom 


b bottom 
b top 


b LE) 
b bottom 
bie 
b top 
b bottom 


b bottom 
b top 


1d 5: 
b bottom 
b ” 
b top 
Des 
b bottom 


bottom 
b top 


bottom 
top 
b bottom 


bottom 
b top 
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were drawn from a table of random numbers normally distributed with a 
mean of zero and a standard deviation of 10. Each subject came once every 
day for eight consecutive working days and made 200 readings at each sitting. 
These gave a total of 12 800 readings. 

Parallel to the main investigation, two further types of graduation, shown 
in Fig. 4, were studied. The first (S) was a standard type currently used in 
comparators ; the widths of its major and minor graduations are the same 
but the majors are longer than the minors. 

The second type of graduation (G) had no minors. Both (‘8’) and (‘ G’) 
appeared in the same colours as the other graduations and each subject made 
100 readings with each type and colour. The resulting 800 readings with 
(S°) and 800 with ‘(G’) were randomly attributed, in sets of 100 readings, 
to the eight testing days. 


1.1. Results 


The subjects differed widely in the extent to which their performances 
improved over the eight days. Accordingly, a regression coefficient was 
calculated for each subject and a deviation score was obtained by subtracting 
the predicted score from the actual score. The deviation scores were used for 
analyses of variance. 

The experimental design permitted the analysis of nine sources of variation. 
The first four were the areas of investigation and the other five were the sub- 
jects, the day, the time of day, the lists of numbers and the variation between 
the sessions with 200 and 300 readings. 

The total number of errors was 2542 over 12 800 readings. Thus about 
20 per cent of the readings were errors. 


Dial frame 


Results were found not to differ significantly for any of the three variables 
studied, although there was a significant interaction between saturation and 
finish (p < 0-05), indicating that a light glossy grey or a dark matt grey (1210 
errors) were better than a light matt grey or a dark glossy grey (1332 errors). 


Dial hand. Again results were found not to differ significantly for any of the 
three variables studied. There was, however, a significant interaction 
between Length and Colour (p0 <-05), indicating that a long hand in the same 
colour as the major graduations and a ‘short’ hand in a different colour 
from them (1205 errors) were better than a ‘long’ hand in a different colour 
from the major graduations and a ‘short’ hand in the same colour as them 
(1337 errors). 


Graduations 

Results for the first variable were significant (p<0-05), indicating that a 
different colour for the major and minor graduations was better than the 
same colour for both types (1202 errors v/s 1340). 


Dial colour 


Ao? 


Again results for the first variable were significant (p<0-01), indicating 
that a light colour (1194 errors) was better than a dark one (1348 errors). 
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The interaction of saturation and colour was also significant (p <0-01) 
indicating that the variance between the eight colours was significant. The 
percentages of errors for the eight colours were as follows: 


per cent per cent. 
darkjcrcen: 5 5 eae 17 light recuse eee 21 
palexyellow: So. Sageeee = 17 darkebl ne zie. cnn) eee 21 
light; blue =) eee ees 18 blacks, Us > peereee 22 
whites. )) =) "S ae Lae 19 ereyish yellow... 9... 25 


A percentage of 2-6 is necessary for a difference to be significant at the 
0-5 level. 

It should be pointed out that the experimental design permitted the 
analysis of results only within each area. The interaction between, say, 
graduations and dial hand is not therefore statistically possible. What can 
be done, however, is to combine the best results obtained within each area 
and thus reconstitute a whole comparator. 


Results with graduations ‘S’ and ‘G’. The following findings are based 
on the original scores (errors) and not on the deviation scores. This is due to 
the fact that no learning was possible with ‘8S ’ and ‘ G’, since these two types 
of graduation were presented only once to each subject. The percentage of 
errors made with the eight graduations taken together is 25 (the highest 
being 28 and the lowest 22). The percentage made with ‘S’ was 41, while 
that made with ‘G’ was 17. The superiority of “G’ is in line with results 
by Loucks (1944). 


The figures seem to indicate that “S’* and ‘G’ compared to the other 


eight types of graduations the worst and the best, respectively. To what 
extent, however, are these results due to chance ? To answer this question, 
the Wilcoxon matched-pairs signed ranks test (cf. Siegel 1956) was applied 
to the data as set out in Table 3. | When ‘S”’ is compared to the eight 


Table 3. Percentages of errors made with different types of graduation in Experiment I 


Subjects 
1 2 3 4 5 6 7 8 
8 graduations together 26 29 29 34 38 16 14 16 
graduation ‘8S’ 58 41 47 43 50 15 37 36 
graduation ‘ G’ 19 8) 20 13 28 8 23 8 


graduations, Wilcoxon’s T=1 which with N=8 has a probability of 0-02 


(2 tail-test). Thus the results with ‘S’ can occur by chance twice in a — 


hundred times. When ‘G’ is compared to the eight graduations, Wilcoxon’s 
T=4:-5. With N =8, Wilcoxon’s table, at the chance level of 0-05 (2 tail test) 
shows a T of 4. Thus the chance probability attached to the obtained T of 
4-5 remains indeterminate, but is in the region of 0-05. The sign-test (cf. 
Siegel) applied to the same data, shows that the chance probability of obtaining 


for the ‘G’ results when compared to the eight graduations results is p =0-07 


(2 tail test). If only the 0-05 chance level is considered as being ‘ significant ’ 
then neither the Wilcoxon test nor the sign test indicate a ‘ significant * result 
for the ‘G’ graduation when compared to the eight other types taken together. 


ETS (ayy 
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Other variables. Hach day was divided into eight hours, one for each of 
the eight subjects. The number of errors made in the morning as against 
those made in the afternoon (summed over subjects and over days) was not 
significant. A second variable was the number of errors made at the beginning 
of the half-day (first two hours a.m. and first two hours p.m.) compared with 
those made at the end of the half-day (last two hours a.m. and last two 1.3); 
This variable was significant at the 0-01 level, the errors being more numerous 
at the beginning than at the end of the half-days. 

Variation between results for different lists of numbers was significant at 
the 0-05 level. The variation between numbers of errors made in sessions 
with 200 and 300 readings was not significant. 


1. Nature of errors made. Number of errors made on the right (positive) 
and left (negative) sides of the scales The errors were distributed in the 
following way for each of the eight subjects. 


Subject 1 (Male) 2(M) 3(M) 4(M) 5 (fem) 6(F) 7(F) 8(F) Total 

Number of errors 352 V3See OO mez oN 612 39m Ome oO 
(right side) 

Number of errors 302 Oe eo el be Gl 6 4 Ga 44: 


(left side) 


A binomial test applied to the errors made on the right and left sides 
(1801 and 1744), resulted in a z=0-96 with p=0-17. Thus the distribution 
of errors on each side is not significantly different from a chance distribution. 


2. Systematic errors of 5 or 10 points. Were the errors made on each of 
the different types of graduation of a special nature? For instance, was 
there a tendency for errors of 5 or 10 points (plus or minus) to be made more 
frequently on one type of graduation than on another? Such errors were 
counted and more than 80 per cent were in the 5 points category, the 
remainder in the 10 points ; no 15 points errors were found. ‘These errors 
were grouped according to the eight subjects and the 10 graduations (8 x 10 
table) and Friedman’s two-way analysis of variance was applied (cf. Siegel 
1956). The result is a y? =2-88 with 9 degrees of freedom and the chance 
level is 0-95. We conclude that the errors of 5 or 10 points made on the 10 
graduations were not different from what one would expect by chance. 


§ 1.2 Discussion 


The striking differences found between the three types of graduations 
((S’, ‘G’, and the other eight,) suggest that the main information for correct 
dial reading is given by the major graduations ; the minors ‘inform ’ only if 
they do not ‘mask’ the information given by the majors. Thus the best 
graduation was that of ‘G’ with no minors. The next best were those of the 
eight main types in which majors and minors were distinct from each other in 
eight and height, while the distinction in “S ’, the worst type, was not at all 
slear, the width being the same and only the 10s differing in length. This 
rinciple is consistent with much previous work. Do the other significant 
indings of the present experiment conform with it ? 


She! 
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The finding that it was preferable to have majors and minors of two different 
colours rather than the same colour, fits very well and does not need any com- 
ment. The finding that a light colour dial tended to be preferable to a dark 
one is of doubtful relevance. The major graduations were always either 
white or black, white when the dial colour was dark and black when the dial 
was light. Thus, there was always a contrast between the colour of the 
majors and the colour of the dial which formed the background ; it could, 
however, be argued that a black figure on a light background stands out 
better than a white figure on a dark background, since material in everyday 
use, such as printing, is of the former kind. If so, the significant finding about 
saturation of dial colour can perhaps fit in with the principle. The three ~ 
significant interactions in the Dial Frame, Saturation and Hand areas do not 
fit in with the principle, but they do not contradict it either. They seem to 
be independent. 

The principle is thus fairly well supported as far as the present data go. 
There are, however, a number of possible objections due to the limits of the 
experimental design, of which two may be especially mentioned. The first 
is the short exposure time of the stimuli : are the results obtained a function 
of this experimental condition ? The second objection is that in readings 
presented to the subjects, the dial hand was always exactly on a unit and 
never between two units. Both these objections have some force. Although 
it could be argued that the dial which has given proof of its comparative 
advantages under these conditions will also show its superiority under more 
‘normal’ ones, discrepancies between the findings of previous investigations 
indicates the need for caution. The second experiment was designed to 
overcome both of these objections and at the same time to cover a greater 
variety of graduations. 


§ 2. Huperiment II 


In the second experiment 16 different dials were used, a dial being a 
combination of one of eight different types of graduation and one of 2 types 
of hand. Fig. 5 shows each of these types and the symbols used to denote 
them. The dials are shown in Fig. 5 at approximately full size. 

Sixteen subjects, eight women and eight men, participated ; each came 
twice and at each sitting made 160 readings, 10 for each of the 16 dials. These 
readings were random numbers with one decimal and the subject had to round 
off to a unit (e.g. 10-4 was 10 ; 10-6 was 11; there were no 0-5s). The same 
160 readings were used. for each subject so that the difficulty was uniform. 

The order in which the 16 dials were presented differed for each subject. 
A latin square of 16 x 16 was used, each subject having the order of presenta- 
tion of a row : thus, over the 16 subjects each dial preceded and followed each 
one of the other dials. In the second sitting for each subject the dials were 
presented in the reverse order. sas 

‘The dials were shown to the subjects by means of a film. The subject, 
seated in front of a screen, pressed a lever opening a small shuttter in the middle 
of the screen on which a dial was projected. The subject was instructed to 
take all the time necessary for the correct reading, then call out the number 
and depress the lever which he still held. This closed the shutter ; the opening 
and shutting times (in 1/100 sec) were automatically recorded on paper ; if a 

' 
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mistake was made in the reading this was recorded by the experimenter. A 
study by Elkin (1959) has shown that the time to verbalise is a constant, 


regardless of difficulty in reading. We therefore have two criteria : time taken 
to make a reading and number of errors. 
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Figure 5. Dial and dial hands used in Experiment II. 
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§ 2.1 Results 


Times. The distribution of the reading times was not normal, as is 
frequently the case with time measurements. A logarithmic transformation 
was therefore used and the resulting scores were subjected to an analysis of 
variance the results of which are shown in Table 4. The F obtained for the 
variation between dials being highly significant, t-tests were carried out to 
compare performances with the individual dials. 


Table 4. Experiment II. Analysis of variance of log times 


Source of variation degrees of Mean 
freedom square F P 
Subjects (S) 15 56859 302-44 0-01 
Dials (D) 15 0-8911 47-40 0-01 
Ciphers in groups of 10 (C) 19 0:4363 23°21 0-01 
Interactions 
sxD 225 0-0503 2-68 0-01 
SxC 285 0:0215 1-14 1.8: 
DxC 285 0-0151 0-80 nS. 
Error 4275 0-0188 
Total 5119 0-4082 


The mean times for the various dials are given in Table 5. The 16 dials 
were divided into two classes, one with the normal dial hand and the other 
with the wider hand. The difference between these two classes was not 
significant. The 16 dials were accordingly grouped in the eight graduation 
categories and ¢-tests were applied. Table 6 shows the results obtained. The 
eight dials can be grouped into four categories, the first two with three dials _ 
each and the last two one each as shown by the vertical lines in Table 6. The 
differences between dials in the same category are not significant but are so 
between one category and another (except for ‘S’ and ‘G’ in groups 2 and 
3). 


Errors. Table 7 shows the number and percent of errors made on each of 
the eight graduations. Friedman’s 2-way analysis of variance was applied 
to the data cast in a 16 x 8 table (subjects and graduations) and the y? =270, 
which shows a chance probability of below <0-01. This test tells us that the 
differences in numbers of errors are not due to chance, but does not specify 
the amount of difference needed between two graduations, which can be 
considered due to chance. 

The Wilcoxon test (cf. Siegel 1956) was applied to pairs of graduations and 
the resulting T's showed the following graduations as being different at the 
0-05 level (from right to left in table 7) : 

Zand G 
Gand § 
S and / (S and A not being different) 
There were no significant differences between the first five graduations. 


We could thus form four groups of graduations : 


179 


of Instrument M easuring Dials 


vcs O 


Some characterist 


[PAX] G0-0 OY} 4B FUBOYTUSIS oq 07 ,X IOJ popeou st o10UI IO SIOIIE CT jo sousIeyIp W 


7 ) S vV V IS ¥ A uomenpeais yo od Xz, 
1-61 6-11 ¥9 LP 6-€ 9-E FE T€ S1O119 JO JUDIE 
GEI 9L If 0€ GE &@ GG 0G SIOIIE JO equim py 
SIolgy “JT yueursodxpp +p ofqey, 

[94°] CO-0 94} 98 JUBOyTUSIS oq 04 7 LOZ 08 COT/9 jo souerTeyIp W 
Z r9) s V EN vV A Vv uorenpears jo ed4y, 
81h | 9-881 | GEST G-6LT G-SLT @-89T 8-F9L LF9L 908 OOT/T Ul ouny uvour o1teuTOEx) 


souly uveut podnory) “fT yuewtaedxy 9g oTqey, 


[PAST G-0 OY} 9V YUBOBTUSIS eq 07 7 10J popoou st e1OUE 10 des YOT/G Jo souer1eyIp W 
N aw of of N of of N N N of N N cl pusy [etd 


of N 
A Vv Vv A A Vv uolyenpess 
Z Z vO g ) Be AYE V 8 Vv pepe 


GHG F-OFG GEGI I-88I I-F81 8-081 €-O81 9-841 &-8LT Z9LT 9 OLT 8-991 F-99T 6-F9L 8-FOT 9-Z9T 
908 0OT/T Ur oUtLy UREUT oLGeUIOOx) 
soully uve “[] JUouTIedxm *¢ o[qey, 


180 Antoine Papaloizos 


Each of these groups is significantly different from its adjacent groups 
except that A is not different from either the first or the second group and has 
therefore been listed in both. 


Relation between errors and time. A rank correlation coefficient (Spearman’s 
rho) between the number of errors and time taken was computed for the 16 
dials. A highly significant (0-01 level) positive correlation of 0-79 was obtained 
i.e. the dials with a short reading time had also few errors. 


2.2. Discussion 


At the end of the first experiment we stated that the major graduations 
provided the main information in dial reading and that the information 
provided by the minor graduations should not ‘mask’ that of the major. 
Are the results of the second experiment compatible with this statement ? 

Let us consider the groupings of Table 6 (those of Table 7 are quite parallel). 
The main difference between groups | and 2 lies in the fact_that in the former 
the major graduation is black while in the latter it is white (A and W vs and 
A and V7); the background, for all graduations, being a white semi-trans- 
parent paper, the black graduation will be more apparent than the white one. 
In groups 1 and 2 we also find ‘A’ and‘S’. These two graduations are also 
differentiated by the width of the majors (cf. Fig. 5). The majors in group 1 
are thus in all cases more prominent than the minors, while in group 2 they are 
either less prominent or very little different from the minors. Thus the first 
2 groups fit in perfectly with our statement above. In the third group we 
find ‘G’ which was the best type of graduation in the first experiment. “G’. 
has no minor graduations, so that our statement about the information of 
majors and minors is not supported, since it would have predicted a better 
Pesuivrsor oAz. 

The following table presents the results with ‘A’ and ‘G’ in the two 
experiments. 


First experiment Second experiment 
Time of reading Errors ‘Time of reading Errors 
per cent per cent 
A 0-2 sec 2g 1-65 sec 2 i 
© 0:2 sec 17 1-89 sec 12 4 


When ample time is given, there is a reduction in the number of errors — 
made for both ‘A’ and ‘G’, but this reduction is considerably greater for — 
‘A’ than for ’G’. The relationship between reading speed and number of 
errors for ‘G’ is worth further experimentation. With very quick readings 
(0-2 sec) ‘G’ is the best type of the graduations we studied ; but when the 
subject takes his time (1-5 sec or more) ‘G’ is one of the worst types. This 
finding is in line with those of Kappauf, Smith and Bray (1947) and Kappauf 
and Smith (1948). What happens to ‘G’ when the reading speed is between 
0-2 and 1-5 of a second? Does the percentage of errors made with ‘ G’ fall 
below 12 per cent at any point ? In other words is the relation between 
speed and errors linear or curvilinear? This question may be important 
when the situation calls for very quick readings to be made. 

The results with graduation ‘Z’ (worst type) fits in with our statemen 
about the information of majors and minors since they were in this typ 
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relatively poorly differentiated. The results with ‘Z’ should, however, be 
taken with some caution, as it was quite a different type of graduation from 
the others studied, demanding a different perceptual set for its reading ; 
furthermore a fault in its design was observed during testing : when the dial 
hand was on the separation between two zones (i.e. on 0-4 or 0-6) it extended 
the black zone so that the reading was quite difficult. In readings involving 
0-4 or 0-6 the hand should have overlapped the zone’s height by a fraction 
only. 


§ 3. CONCLUSIONS 


Results of both experiments can be summed up by saying that for correct 
and quick dial reading the relationship between the major graduations on the 
one hand and the minor graduations and dial on the other, should be one of 
‘figure’ and ‘ground’. By a figure-ground relationship we mean that the 
major graduations must clearly stand out of, and be distinct from, the rest of 
the field of perception. Although what is ‘ figure’ and what is ‘ ground’ are 
to some extent under the voluntary control of the observer, by focussing the 
attention on one part or another of the field of view, there are certain physical 
characteristics, such as boldness of lines, colour, contrast, etc., and organisation 
of the whole, which make some parts more prominent, more ‘ figural’ than 
others. In the case of dials, given two types of graduation, one having lines 
thicker or bolder in some other way than the other, and the same dial colour 
for both, the graduation with the bolder lines will be more prominent, more 
‘ figural ’ than the other. 

The special case of omitting altogether the minor graduations, is advan- 
tageous for very quick readings (0-2 sec) but not with ample time (1-5 sec). In 
between these two reading speeds, the special case is still indeterminate. 


¢ 


We wish to express our thanks to Mr. Jean Cardinet for the experimental 
design, to Mr. Ferdinand Maire for preparing the dials and to Mr. Roger 
Kramer for making the apparatus. 


Le présent article relate deux expériences entreprises pour apprécier la lisibilité de cadrans 
destinés 4 équiper des appareils de mesure. Les résultats des deux expériences montrent qu’un 
facteur favorise une lecture rapide et correcte. Ce facteur stipule que les graduations principales 
(10 et 5) d’une part, et, d’autre part, le cadran et les graduations secondaires, se présentent dans 
le méme rapport qu’une figure sur un fond. 


‘Zwei Experimente wurden ausgefiihrt zur Schaitzung der Lesbarkeit von Zifferblittern. Die 
Resultate beider Experimente zeigten die Wichtigkeit eines Faktors fiir fehlerfreies und schnelles 
Ablesen : nahmlich dass die Hauptunterteilungen (fiir 10 und 5) einerseits und das Zifferblatt und 
die kleineren Unterteilungen anderseits in Verhaltnis zu Figur und Grund sein sollen. 
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DIFFERENTIAL EFFECTS OF KNOWLEDGE OF RESULTS AND 
DISCUSSION 


By B. T. Jensen, J. R. Tritton and D. N. ANDERSON 


System Development Corporation 
Santa Monica, California 

A training experiment was initiated at a radar station of the United States 
Air Force. External and unanticipated events so attenuated the experimental 
conditions that the effects cannot be evaluated unambiguously. However, the 
change in performance of crews supports the notion that knowledge of results and 
discussions contribute to improvement by helping to develop understanding of 
the environment and common modes of response among members of a group. The 
paper describes the study and details the changes in performance in relation to the 
kinds of information available to the crews. 


§ 1. InTRODUCTION 


For a number of years the United States Air Force Air Defense Command 
(ADC) has had a system training programme designed to improve the day-to- 
day proficiency of radar crews and to increase their flexibility in responding 
to new situations which might arise in the event of hostilities. As part of 
the programme, observers of a specific training exercise provide knowledge 
of results of the crew’s operations by reporting to the crew members regarding 
their achievement; the crew then discusses its work and procedures in an 
attempt to improve operations. Although this combination has been demon- 
strably effective, questions were raised as to the relative contributions of the 
knowledge of results and of the discussion sessions. An attempt was made 
to gain information in a field setting by experimentally manipulating the 
training programme in an operational radar station. Conditions beyond the 
experimenter’s control weakened the study, but it nevertheless yielded 
valuable observational data. 

The data do not lend themselves to refined statistical analysis, but do 
indicate that the report to the crews and the discussions had strong effects. 
Specifically, they aided in developing, among the crew members, a common 
agreement regarding responses and a common and correct ‘model of the 
environment’, in the sense of an understanding of requirements and the 
responsiveness of the external world to crew activities. 

The research was conducted in a military setting using ADC’s System 
Training Programme (STP). The study involved only four radar crews. 
However, we believe that intensive analysis of a small number of cases, when 
related to a conceptual framework based upon study of other cases, does lead 
to significant interpretation. The comments made in this paper draw upon 
considerable experience of the writers in developing programmes for, and 
monitoring and doing research upon, the training of man-machine systems 
both prior to and following the experiment reported here. 


§ 2. Tue SETTING 


The STP was based upon experimentation in the System Research Labora- 
tory of the RAND Corporation. There, subjects organized-as crews, were 
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presented with simulated radar information indicating all air traffic in a given 
area (some of it supposedly hostile) and the crews would attempt to defend 
the area by performing activities leading to the shooting down of the invaders. 
Immediately after this type of excerise, observers reported to the crew on its 
achievement relative to the stimulus; e.g., ‘The crew intercepted ten of 
twelve invader aircraft.’ The crews participated in problem-solving discussions 
after receiving the reports. Observers noted a marked improvement in the 
performance of the crews working under some of the experimental conditions. 
Further experiments and development work led to STP in which radar crews 
of the ADC worked at their tasks of responding to simulated air traffic appear- 
ing on radar scopes during a two-hour exercise which was followed by perform- 
ance information (called ‘ feedback ’), and discussions (called ‘ debriefings ’). 
Additional information regarding the programme and its development can be 
found in papers by Alexander, et al. (1959), Chapman, et al. (1959) and Goodwin 
(1957). 

Continued experimentation indicated ways in which feedback should be 
changed with subsequent gain in achievement by the crew in training. A 
field experiment utilizing real crews at a temporarily inactive radar station 
indicated clear-cut superiority of crews having the new feedback and debriefings 
as compared to those experiencing only the training exercises. 


§3. PLAN OF THE PRESENT STUDY 


The present study examined performance of crews participating in the 
training exercise under four sets of conditions designed to enable feedback 
and debriefing to be studied separately and in combination as shown in Table 1. - 


Debriefing 
Yes No 
Yes Crew FD Crew F 
Feedback el 
No Crew D Crew O 


Table 1. Experimental Conditions 


The observations were made at a newly-constructed radar station on the 
California coast. The ADC had agreed to continue this particular station on — 
an Inactive status (subject to changing defence needs) for about three months 
to permit experimental modification of the usual programme. The Commander 
of the station assigned thirteen enlisted men with one or more years of Air 
Force experience to serve as training personnel and to collect data required. 
One or more civilian representatives of the System Development Corporation 
was at the station at all times during the conduct of STP activities. 

Standard materials in STP include a problem film which, when fed through 
a special type projector, creates a dynamic air picture on the radar scopes. 
To aid In preparing feedback, the observer is provided with a set of maps 
giving the position of each aircraft at each five minutes throughout the two- 
hour exercise, scripts for simulated inputs and recording forms which list 
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critical ° flights (those of special concern in that exercise) and describe charact- 
eristics of those aircraft. For this study, special films had been prepared 
giving extra heavy traffic in the area to be defended by this station. The 
various films included a peacetime problem with situations requiring special 
action; for instance, an air-sea rescue mission looking for survivors. Other 
problems included wartime situations with ‘ invader ’ aircraft, and SAC flights 
on retaliation missions. Scripts were prepared for the simulators of adjacent 
stations indicating what information should be passed into the station from 
each of several agencies, and at what times. Since not all actions and decisions 
could be standardized in script form, a guide was prepared for the simulator 
of higher headquarters indicating what action he should take in regard to 
different items of information which might be passed to him. 


§ 4. SEQUENCE oF Events 


This section describes what actually happened, not what was planned. 
From a pool of enlisted men and junior officers, four radar crews were organized 
so that all were approximately equal in rank and experience of the members. 
Most of the members had no prior experience in operating crews. Each crew 
was introduced to the training programme by lectures covering the purpose 
and the nature of the programmes, and the geographical area of responsibility— 
that is, the areas to be defended by this crew and by crews at adjacent radar 
stations. The lectures were followed by two practice, or shakedown, sessions 
to permit each crew to effect its internal organization and to become familiar 
with the conduct of STP. 

Following these each crew was given a rather heavy-load wartime problem 
with many aircraft on the radar scope, a large proportion of which were invaders 
This exercise served as a pre-test and was repeated three months later. 

Within two hours after the pre-test exercise for the fourth crew, the Air 
Defense Command activated the station. It was then manned around the 
clock, so that the crews which should have been trained in the experimental 

programme were experiencing at least forty hours of live operations each week. 
Furthermore, a number of new men, previously unassigned to operations, 
were placed in the crews for training purposes; however, men did not serve 
on more than one crew except one officer and one crew chief (enlisted man) 
who were transferred shortly after the declaration of emergency. 

After about ten days, ADC decided to continue the experimental training 
‘programme modified so as to permit continuous operations. ‘This meant that 
during each training exercise, part of the crew dealt with live traffic and handled 
emergencies, and that on some occasions, STP was delayed or postponed 
because of operational demands. 

During the next few weeks all crews experienced approximately the same 
number of STP exercises, receiving feedback or debriefing as planned (see 
Table 1). However, we know that crews had meetings during regular shifts, 
particularly on the midnight turn, at which they discussed operations, and no 
attempt was made to stop these activities. Furthermore, because of opera- 
tional needs the regularly scheduled debriefings were many times cut short 
and were often not attended by one or more crew members. Another factor 
limiting the amount of time spent in debriefings was the fact that the training 
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personnel manned the station during debriefing time and applied pressure 
upon the crews to return, particularly in the evenings when such relief indicated 


the end of the day for the training people. 


§ 5. Discussion oF RESULTS 


Because of equipment difficulties, which interrupted the exercise, and 
because four of the most capable persons were re-assigned on the morning 
of the post-test, the performance of Crew FD on the post-test was very poor— 
it was obvious to all that it was performing far below its usual level. For 
this reason, objective data on this crew have not been considered, although 
other observations of it contribute to the general discussion and interpretation 
of results. 

The main changes between pre- and post-tests in the performances of the 
three remaining crews are set out in Table 2. Clearly the crews changed to 


Function Crew -F Crew D Crew O 
Detection 

Per Cent * T tf 

Delay * iF 5 
Tracking t ir 7 
Telling : 

Required iP a * 
Not Required ae * "0 


* No change 
+ Performance on Post-test more desirable 
~ Performance on post-test less desirable 


Table 2. Relative Improvement of Three Crews 


different extents in the several component functions of their task. In the 
following paragraphs we attempt to illustrate how different understanding of — 
the task led to the preoccupation of crews with different functions, and how — 
feed back affects perception of the task. All crews had been informed that 
the primary mission of this station was the detection and tracking of aircraft 
appearing in certain parts of the area visible by radar. In as much as all 
started at about the same level of skill, we could expect all crews to gain an 
equal amount from practice. Deviation from uniformity may therefore be 
reasonably attributed to the effects of the differing treatments received by 
the crews. 

(a) Detection. The only crew to improve in this function was Crew D, 
who had debriefing time to discuss operations. Fortunately for the crew, the 
approach of the crew leaders was, ‘‘ What problems do we have which keep us 
from doing our job?” The discussions were aimed then at clarification of 
tasks, interpretation of regulations, and reducing the communication difficulties 
among members. Out of this kind of discussion one would expect a common 
understanding of goals and procedures to emerge. This was important because 
in all crews several persons performed the detection function, rotating in the 
position at the radar scopes; and at all times, at least two individuals observed 


different parts of the area simultaneously. The deterioration of Crew O at 
detection will be considered later. 
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(b) Tracking. All three crews improved about the same on the function of 
tracking after detection. Airborne objects reported by the observer were 
plotted with time of entry on a large plexiglass map in front of the room. All 
crew members could see this display; discrepancies and lack of currency were 
visible. One of the crew leaders had the specific responsibility of seeing that 
this display was current and accurate. It would be surprising if new crews 
did not improve in this highly visible task. Only one crew had feedback on 
tracking, but apparently the feedback contributed little under these conditions. 
(Papers by Berkowitz (1958) and by Geddes et al. (1959) provide data and 
comments regarding effects of making performance of a function visible.) 

(c) Telling. ‘The effects of information about the job are seen in the function 
of telling (reporting air traffic to other stations). Table 2 shows that Crew F 
improved in telling required information. This crew it will be remembered, 
received post-exercise reports on how many tracks it reported compared with 
the number that should have been told: in short, it was confronted regularly 
by evidence of inadequacies. 

Crew F also however plotted a great deal of traffic in its ‘ free’ area (i.e. 
safe areas that did not need to be watched routinely). The crew worked 
very hard to plot and maintain traffic, but it did so at the expense of main- 
taining good defence; its energies were expended in unnecessary ways because 
no one in the crew really understood the nature of the world about them. 
This was probably due to the fact that the feedback included the information 
that the crew was making unnecessary reports but gave no information about 
which kind of traffic should be reported to other stations. 

Crew D, with no reminders about telling, became less efficient in performing 
several jobs requiring cooperation with other stations. Its attention seemed 
to be focussed on doing its more visible tasks. 

: Crew O, on the other hand, with neither feedback nor debriefing did not 
deteriorate in performing required telling. This difference from Crew D was 
due to the fact that Crew O did more telling than any other, although most of 
it was not required. This crew elected to do its task by ‘ brute force’ ; the 
crew chief said to an experimenter, “‘ We don’t know what we are supposed 
to do, so we do everything we can.” The chief of Crew O had worked with 
Crew F for one week during which time the crew had one training exercise 
with feedback which included a report on telling. Thus, this crew chief knew 
that telling was important but he did not remember the requirements, even 
though they had been stated to all crews at the beginning of the study. A 
similar situation existed with respect to detection; Crew O detected and plotted 
a great deal of air traffic but most of it was in the so-called free area. 

Looking at the data of Table 2 in terms of crews rather than functions, it 
can be seen that Crew O, having neither feedback nor debriefing deteriorated 
in both Detection and Telling. Crew D, with debriefing, improved in Detection 
but deteriorated in Telling, while Crew F, with feedback, improved in Telling 
but not in Detection. 
~ Crew FD added to the understanding of effects of feedback. For this 
study, the feedback was quite detailed, not defining problems for the crew 
but giving facts from which members could induce problems. The leaders 
of this crew were extremely anxious persons (more so than in any other crew), 
more inclined to search for an excuse for a failure than to define problems and 
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formulate solutions. In the present instance detailed feedback focused the 
group’s attention upon specific acts. Considerable buck-passing occurred 
inasmuch as the feedback engendered much hostility. This situation does 
not negate the observations that feed back can be helpful, particularly if the 
report is designed for the specific group. This matter is treated at some length 
in another paper (Tilton & Jensen 1960), Nor does it negate the findings that 
specific information can lead to considerable improvement in performance. 

The general results support the idea that feedback (knowledge of results) 
and debriefings (discussions) contribute to improvement in crew performance. 
It is believed that the major way in which these factors contribute is through 
aiding in the development of a model of the environment, in terms of which 
goals may be set and procedures worked out. In the absence of such a model, 
performance is likely to be undirected: the excessively vigorous approach 
of Crew O is an example of a reaction which was probably as satisfactory as 
was possible in the circumstances. The desire on the part of subjects for 
such a model has been indicated by subsequent laboratory studies in which 
we have noticed that considerable effort is directed towards the question, 
“What is the nature of the environment ? ” 


The research reported herein is being performed pursuant to a contact 
with the United States Air Force, Department of Defense. 


Une expérience d’entrainement a été instaurée dans une station de contréle par radar aux 
Etats-Unis. Des conditions extérieures et imprévues ont atténué a tel point les conditions 
expérimentales qu’il a été difficile d’évaluer les effets de fagon non ambigiie. Il semble toutefois . 
que la connaissance des résultats et les discussions puissent améliorer les performances dans 
l’équipe en permettant une meilleure compréhension de l’environnement et des modalités de 
réponse chez les membres du groupe. Le présent article expose cette expérimentation et 
interprete les variations de la performance en fonction du genre d’information fourni aux équipes. 


Auf einer Radarstation der amerikanischen Luftwaffe wurde ein Trainingsexperiment in 
Angriff gnommen. Aussere und unvorhergesehene Ereignisse beeinflussen die experimentellen 
Bedingungen so, dass die Wirkungen nicht unzweideutig bewertet werden kénnen. Jedoch 
unterstiitzt die Arbeitsverinderung einer Mannschaft die Vorstellung, dass die Kenntnis der 
Ergebnisse und Diskussionen zur Verbesserung beitragt, indem sie dabei helfen, die Umgebungs- 
bedingungen und die typischen Reaktionen der Mitglieder einer Mannschaft zu verstehen. Diese 
Arbeit beschreibt die Studie und bespricht eingehend die Verainderungen in bezug auf die 
Informationen, iiber die die Mannschaft verfiigte. 
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LETTER TO THE EDITOR 


Dear Sir, 
FRANKLIN V. TAyLoR AWARD 


Nominations are now being sought for the recipient of the first Franklin V. 
Taylor Award for outstanding contributions in the field of engineering psy- 
chology. This award was established by Division 21 (Engineering Psychology) 
of the American Psychological Association in September 1960. It is planned 
to make the first presentation at the annual meeting of the Division in Sep- 
tember 1961. Membership in Division 21 is not a prerequisite for the nominee. 
The deadline for receipt of nominations is 1 July 1961. 


I shall be pleased to supply nomination blanks upon request. 
Yours etc., 
LEONARD C. M@ap, 
Chairman of the Award Committee. 


Tufts University, 
Medford 55, Mass., 
U.S.A. 
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BOOK REVIEWS 


Engineering Management. By Srruan A. RoBERTson. (Glasgow : Blackie 
& Sons Ltd., 2nd edition, 1960.) [Pp. xii+460.] 30s. 

THE purpose of the book is to provide a basis for a course of lectures suitable for 
degree and Higher National students in preparation for the examination of the 
Institution of Mechanical Engineers in Industrial Administration. However, 
even the general reader interested in management, should find much wisdom and 
inspiration on many pages. Management is sub-divided into three parts: The 
Art of Management, The Science of Management, and The Practice of Management. 
The Art of Management emphasises the less tangible attributes for successful 
management, in particular the initiative required to create an enterprise and to 
inspire a staff to work with conscious desire for its continuing prosperity. The 
Science of Management includes chapters dealing with the structure of industry, 
the broad legal aspects of which a manager should be aware, and the developments 
of scientific management over the past two hundred years. The Practice of 
Management forms the bulk of the book. Commencing with the preliminary 
consideration to be taken before starting a new enterprise, the sections develop 
logically through policy formulation, company promotion and organization, thence 
to the factory, the work people, production, the customer and finally to the methods 
of overall control for checking the well-being of the business. In the section on 
work people there are short chapters on personnel selection, contracts of employ- 
ment, T.W.1, technical and management education. Sound treatment is given 
to the problems of Industrial Health and Safety. Mention is also made of the areas 
in which Operations Research and Cybernetics can make effective contribution to 
management. J. CHERRY. 


People and Productivity. By A. B. Wartnec. (London: British Productivity 
Council, 1959.) [Pp.iv+25.] 25s. 
Tue author, who is Chairman of Joseph Lucas (Industries) Ltd., is at present also 
Chairman of the British Productivity Council. His association with scientific 
management is of long standing and he is well known for his liberal approach to 
the human relations problems of industry. All this is reflected in this typographic- 
ally excellent booklet. Four important aspects of personnel management are 
described under the section headings: Factory Competitions, Workpeople’s Pro- 
duction Committees, Workpeople’s Suggestion Schemes, Support for the Foreman. 
Each section begins with a short homily which endeavours to set out the thinking 
behind the procedures described, and concludes with hortative warnings and advice 
iderived, no doubt, from a hard school of experience. But the hard core of the 
sections, which is called “Administration Methods ”’,, is really first rate material. 
1A more practical guide could hardly be found for those approaching these personnel 
jmanagement practices for the first time. The sample procedures have been care- 
fully thought out and are very well presented; they describe in some detail the 
routine work involved in carrying out the various schemes described. Mr. Waring 
an very well claim to have attained his declared object: 4 To demonstrate some 
practical methods by which the experience of human relations in industry can be 
a pplied to the advantage of production and the satisfaction of both the workpeople 
Jand the staff concerned with it.” Seymour Hits, 
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Research Techniques in Human Engineering. By ALPHONSE CHAPANIS. (Balti- 
more, Md.: The John Hopkins Press, 1959.) [Pp. xi+316.] $6.00. 

Ix common with a great many of his American colleagues, Dr. Chapanis is always 

able to demonstrate a high level of competence as an experimental psychologist 

which can only have developed from considerable capacity, sound training, wide 

reading and long experience. In short, he is a professional in the best sense of 

the word. He also has the less common ability to be both lucid and succinct 

while retaining an individual somewhat colioquial style. We should, therefore, 

expect great things from this book, particularly since it is the first one of 

its kind to appear. Other authors have refrained from making the attempt ~ 
on the grounds that research techniques can only be learned by doing research, ~ 
so that, on reading this book one grasps the full significance of each sentence in the 
light of personal experience, or one sees it as a statement which is either incompre- 
hensible or obvious. Dr. Chapanis counters this kind of argument on the grounds 
that the tactics and strategy of research can be taught. This, of course, is different 
from teaching the research techniques of a particular subject. However, not only 
does the author know about the techniques but it emerges very clearly that he has 
actually used them so that the practical as well as the theoretical points are made. 
This is the real value of the book. It should convey a great.deal to students who 
have just begun research and to research workers who wish to use a technique ~ 
which has previously been outside their practice: For example Chapter 2 may doa ~ 
considerable service for British biologists in persuading them to pay more attention — 
to what we could call Methods Study. These procedures are simple, quick and 
inexpensive, but still very effective and elegant if properly used. As a book for 
study by undergraduates most of the material can, perhaps, be justified at the 
advanced level of evaluating original research and for engineers one can hope that 
they will be persuaded to be rather more cautious in embarking on experiments 
in which human behaviour is involved. The chapters alternate between practical/ 
philosophical essays (Introduction, Methods for Study of Accidents, The Experi- 
mental Methods, Some Problems of Experimenting with People) well illustrated. 
by experiments from the literature and the more usual test-book material (Methods 
of Direct Observation, Statistical Methods, The Psychophysical Methods) which 
will serve as very useful introductions. The last chapter “Articulation Testing 
Methods ” deals with a much more specialized subject, but it does serve to demon- 
strate by example the detailed care with which a particular topic of research in 
human engineering has very often been dealt with. Thus the book covers what 
little we know, to date, about the methods of experimental psychology, although 
to justify the title a much longer book dealing also with control engineering and 
physiological techniques would be desirable. W. T. SINGLETON. 
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1, Articles for publication should be sent to one of the Editors or to any 
Member of the Editorial Board. 


2. Papers must be in English, French or German, Every paper must be 
accompanied by a brief summary, and contributors are asked if possible to 
supply summaries in all three languages. 


3. Authors should submit a typescript, double-spaced on one side of the 
paper only. Footnotes should be avoided. Summaries, tables and legends 
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cost of any alteration in proof other than printers’ errors may be charged to 
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half-tone illustrations, but diagrams not suitable for reproduction without 
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5. References in the text should be indicated by author’s name followed by 
the date. They should be listed alphabetically at the end of the paper in the 
style illustrated by the following examples: 


Bantuett, F. C., 1943, Fatigue following highly skilled work. Proe. 
: roy. Soc. B, 131, 247-254. 
BepForD, T., 1948, Basic Principles of Ventilation and Heating (London: 
H. K. Lewis). 
Lz Gros Ciark, W. #., 1954, The anatomy of work. In Symposium on 
Humon Factors in Equipment Design (Edited by W. F. Floyd 
and A. T. Welford) (London: H. K. Lewis). Pp. 5-15. 


Abbreviations should be as in the World List of Scientific Periodicals. 


6. Consideration for publication will gladly be given to papers which 
have previously had a limited circulation as research reports. Submis- 
sion of s paper implies, however, that it has not been published and will not be 
published elsewhere without the permission of the General Editor and the 
* Publishers. Copyright in material accepted for publication is retained by the 
: Journal, and reproduction in whole or in part is forbidden except under the 
terms of the Fair Copying Declaration of the Royal Society or with the written 
~ permission of the Publishers. | . 


oa 7. Authors will receive 25 copies of their contributions without charge. 
Additional copies may be ordered at the time of returning proofs. Prices for 
additional copies may be obtained from the Publishers. 

8, Authors of papers of ergonomic interest published in other journals and 
f unpublished reports which are available on request, are invited to submit 
ief summaries to the Reviews Editor for inclusion in this journal. Sum- 
ies must be accompanied by a copy of the full paper. 3 
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